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The  research  covers  several  aspects  of  the  basic  issues  that  are 
needed  to  develop  and  implement  nonlinear  filtering  and  control  of 
maneuvering  vehicles  in  uncertain  environments  and  nonlinear 
geometry.  The  research  is  involved  in  modelilng  maneuvering  nonlinear 
vehicles  as  switched  linear  Markov  models.  The  research  therefore 
leads  in  several  directions  investigating  various  aspects  of  such 
models  which  in  general  are  called  hybrid  systems.  Three  different 
aspects  are  considered;  The  first  involves  realization  and  other 
generic  properties  of  hybrid  systems,  including  controllability  and 
stability  as  well  as  simulation  and  averaging.  The  second  involves 
estimation  and  detection  systems  for  hybrid  systems,  including  various 
related  models  and  aproximate  filtering  schemes.  The  third  involves 
the  application  of  switched  Markov  filtering  schemes  to  the  tracking 
of  maneuvering  vehicles. 
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SUMMARY 

This  is  the  final  report  of  Grant  AFOSR-89-0241  with  the  Air  Force  Office  of 
Scientific  Research,  which  is  the  continuation  of  Grant  AFOSR-87-0308  to  the  Georgia 
Institute  of  Technology.  The  research  was  performed  at  Northwestern  University  with 
subcontract  to  Georgia  Tech. 

The  research  covers  several  aspects  of  the  basic  issues  that  are  needed  to 
develop  and  implement  nonlinear  filtering  and  control  schemes  for  hybrid  systems  with 
applications  to  tracking,  guidance,  and  control  of  maneuvering  vehicles  in  uncertain 
environments  and  nonlinear  geometry.  The  research  is  involved  in  modeling  maneuvering 
nonlinear  vehicles  as  switched  linear  Markov  models.  The  research  therefore  leads  in 
several  directions  investigating  various  aspects  of  such  models  which  in  general  are  called 
hybrid  systems.  Three  different  aspects  are  considered:  The  first  involves  realization  and 
other  generic  properties  of  hybrid  systems,  including  controllability  and  stability  as  well  as 
simulation  and  averaging.  The  second  involves  estimation  and  detection  systems  for 
hybrid  systems,  including  various  related  models  and  approximate  filtering  schemes.  The 
third  involves  the  application  of  switched  Markov  filtering  schemes  to  the  tracking  of 
maneuvering  vehicles. 

The  research  culminated  in  the  conclusion  of  two  Ph.D.  thesis  by  J.  Ezzine  and 
M.  A.  Ingram  at  Georgia  Tech  and  one  M.S.  project  by  K.  S.  Lee  at  Northwestern 
University.  It  also  supported  the  initial  stages  of  three  Ph.D.  students,  P.  D.  West.  C.  C. 
Tsai,  and  D.  R.  Shin  who  are  in  various  stages  of  completing  their  Ph.D.  dissertations. 
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SECTION  I 


INTRODUCTION 

The  objective  of  this  research  was  to  develop  nonlinear  filtering  and  tracking 
schemes  for  systems  subject  to  complex  geometries  and  uncertainties.  These  attributes 
characterize  the  air-to-air  engagement  scenario.  The  approach  was  based  on  the 
approximation  of  the  original  nonlinear  stochastic  model  with  a  piecewise  linear  model. 
Then  the  resulting  model  was  further  approximated  by  a  switched  Markov  linear  model. 
The  resulting  model  becomes  then  a  typical  representation  of  hybrid  systems  involving 
both  continuous  and  discrete  dynamics  as  shown: 


X(t)  >  Atrffl]  X(t)  +  B[r(t)l  U(t) 

(1a) 

Y(t)  =  C[r(t)]  X(t)  +  V(t) 

(1b) 

where  the  state  vector  is  X(t).  the  observation  vector  is  Y(t),  U(t)  can  serve  as  either  the 
control  vector  when  considering  a  control  problem,  or  as  the  process  noise  model 
representing  the  model  uncertainties  for  the  filtering  problem,  and  V(t)  is  the  observation 
noise  vector.  The  noise  processes  are  assumed  to  be  white  and  Gaussian.  The  process 
r(t)  is  called  either  the  form  index  or  the  macro-state  process  and  is  assumed  to  be  a 

finite  state  Markov  process  taking  the  values  in  the  set  {1,2 . N}.  The  resulting  system 

is  known  as  either  a  switched  Markov  linear  model  or  a  hybrid  system  model  since  the 
state  X(t)  is  continuous  and  the  vector  r(t)  is  discrete.  The  linear  system  switches  among 
the  finite  number  of  realizations  (A[i],  B[i],  C[il)  depending  on  the  value  of  r(t),  and  the 
switching  follows  a  Markov  chain  rule. 

The  research  reported  under  the  earlier  grants  covered  both  the  analysis  of  the 
switched  Markov  approximation  to  the  modeling  of  nonlinear  systems  as  well  as 
realization  and  characterization  results  on  hybrid  systems.  These  reports  also  discussed 
filtering  schemes  for  such  systems  and  similar  models  that  involve  the  dependence  of  the 
Markov  process  parameters  on  the  system  state. 

This  report  addresses  continuation  of  these  efforts  and  resulted  in  the  conclusion 
of  two  Ph.D.  theses  at  Georgia  Tech.  The  two  completed  theses  covered  two  different 
aspects  of  the  mixed  models  that  include  both  discrete  and  continuous  variables.  The 
first  is  by  Jelei  Ezzine  (Reference  1)  considered  the  properties  of  hybrid  systems  involving 
both  discrete  and  continuous  states  which  in  our  case  reflects  the  switched  linear  Markov 
models  used  to  represent  the  maneuvering  vehicles  to  be  tracked  and/or  controlled.  The 
thesis  studied  the  stability  and  controllability  properties  of  such  systems  and  derived 
conditions  under  which  the  systems  can  be  approximated  by  their  statistical  average 
system.  The  second  thesis  is  by  Mary  Ann  Ingram  (Reference  2)  considered  an 
alternative  model  for  maneuvering  vehicles  and  derived  approximate  filtering  schemes  for 
such  models  that  involve  linear  systems  driven  by  impulsive  inputs  whose  rates  depend 


2 


on  the  state  of  the  system.  Since  exact  filtering  representations  are  not  realizable, 
conditions  for  the  convergence  of  several  detection-estimation  schemes  were  obtained 
and  their  result  validated  via  simulations. 

This  report  concentrates  on  extensions  of  the  results  to  three  areas.  The  first  area 
involves  the  simulation  and  analysis  of  the  multi-modei  approximate  filtering  scheme  that 
has  been  tested  earlier  using  limited  memory  only  and  its  extension  to  three-dimensional 
tracking  filter  for  a  maneuvering  target  and  Is  reported  in  Section  11.  The  second  area, 
covered  in  Section  III,  addresses  the  analysis  and  control  of  hybrid  systems  when  the 
both  the  state  dynamics  and  the  form  index  exhibit  fast  and  slow  modes  of  behavior.  The 
third  area  considers  additional  work  in  the  realization  of  hybrid  systems  and  is  discussed 
in  Section  IV.  The  body  of  each  section  is  relatively  short,  as  the  results  are  provided  in 
appropriate  appendices. 
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SECTION  II 


FILTERING  SCHEMES  FOR  HYBRID  SYSTEMS 

Several  models  and  approximations  have  been  considered  for  the  filtering  schemes 
for  hybrid  systems  and  their  applications.  Ingram’s  thesis  (Reference  2)  considered  a 
continuous  state  model  with  Markov  chain  Input  whose  transition  matrix  depends  on  the 
state  of  the  systems.  Exact  filtering  schemes  cannot  be  derived  analytically  or 
implemented  numerically.  A  new  prior  penalty  approach  to  the  filtering  for  such  systems 
has  been  proposed  and  analyzed  by  Ingram  in  Reference  3  and  is  shown  in  Appendix  A. 
The  resulting  filter  is  superior  to  the  suboptimal  linear  smoother  when  the  rates  of  change 
of  the  Markov  process  are  very  low  and  when  the  impulsive  input  jumps  do  not  take  very 
small  values. 

In  addition  to  the  research  reported  in  Ingram's  thesis,  additional  approximation  to 
the  nonlinear  filtering  structure  reported  in  Reference  4  has  been  proposed.  In  particular, 
the  effort  has  been  centered  at  reducing  the  memory  requirement  of  the  multi-model  filter 
as  well  as  providing  a  more  realistic  simulation  scenario.  In  particular,  an  extension  of  the 
memory  of  the  filter  to  4  steps  has  been  shown  to  provide  a  substantial  improvements 
over  the  single  step  memory  filter,  as  shown  in  Reference  5  and  attached  as  Appendix 
B.  Furthermore,  it  has  been  shown  that  increasing  the  filter  dimension  does  not  result  in 
reduced  performance  as  discussed  in  Reference  6  and  attached  in  Appendix  C.  This 
latest  work  indicates  that  the  filter  is  applicable  to  a  three  dimensional  tracking  problem, 
and  provides  and  alternative  approach  to  the  modeling  of  the  maneuver  acceleration. 
Analysis  methods  to  indicate  the  asymptotic  convergence  of  the  filter  and  its  performance 
are  encouraging. 

Finally  Reference  7  discusses  a  general  framework  for  the  filtering  and  smoothing 
for  systems  with  both  discrete  and  continuous  observation  models.  The  results  are 
primarily  analytical  in  nature  and  the  implementation  issues  have  not  been  resolved  as  yet. 
The  representation  is  shown  in  Appendix  D. 
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SECTION  III 


ANALYSIS  OF  HYBRID  MODELS 

Two  avenues  of  research  have  been  followed  in  the  simulation  and  analysis  of 
hybrid  systems  models.  The  first  established  analytical  and  simulation  tools  for  the  study 
of  how  well  such  models  can  be  used  to  approximate  piece-wise  linear  dynamic  systems. 
Earlier  results  simply  addressed  the  first  order  densities  of  such  models.  In  this  study  the 
autocorrelation  function  of  both  the  model  and  the  original  system  have  been  simulated 
and  compared  to  verify  the  conditions  (earlier  only  studied  in  theory)  for  the  validity  of  the 
approximation.  Furthermore,  an  analytical  approach  has  been  developed  for  the  analysis 
of  the  steady-state  stationary  probability  density  of  the  system  model  and  its  comparison 
to  the  approximating  hybrid  model.  The  results  are  documented  in  an  M.S.  project  by  K- 
S.  Lee  shown  in  Reference  8. 

The  second  continued  the  research  into  hybrid  systems  models  that  involve  both 
fast  and  slow  dynamics.  The  fast  and  slow  dynamics  are  involved  in  both  the  systems 
models  and  in  the  Markov  chain  that  determines  the  transition  among  the  various 
realizations.  Earlier  work  (Reference  9)  was  concerned  with  the  limiting  behavior  of  such 
systems  when  the  Markov  chain  was  either  fast  or  slow.  More  recently,  the  results  have 
been  extended  to  the  case  where  the  Markov  chain  can  be  decomposed  into  groups  of 
fast  transitions.  Furthermore,  asymptotic  results  for  the  convergence  of  the  reduced-order 
models  have  been  derived  for  a  variety  of  cases  of  fast  and  slow  behavior  in  the 
continuous  system  model  and  in  the  underlying  Markov  chain.  The  results  are  given  in 
Reference  10,  and  are  shown  in  Appendix  E.  One  restriction  to  the  resulting 
approximation  is  that  system  matrices  of  the  realizations  involved  in  each  group  of  fast 
transitions  have  to  commute.  More  recentiy.  this  restriction  has  been  successfully 
removed  as  shown  in  Reference  1 1  and  attached  in  Appendix  F.  However,  the  results 
still  require  the  stability  of  each  group  of  realizations.  The  research  also  provide  complete 
analysis  of  the  multiple-time  scale  approximation  for  such  systems  for  both  the  slow  and 
the  fast  dynamics  of  the  system.  The  relative  ratio  of  the  time-scale  of  the  Markov  chain 
transition  matrix  to  those  of  the  continuous  states  is  crucial  to  the  type  of  the  resulting 
approximation. 

Rnally,  the  conditions  for  the  control  and  stabilization  of  hybrid  systems  using  the 
average  model  constant  gain  controllers  or  switched  gains  controller  that  may  depend  on 
the  correct  detection  of  the  macro-state  have  also  been  derived.  Furthermore,  if  we 
assume  that  it  is  not  possible  to  correctly  identify  the  macro-state  (the  value  of  r(t))  of  the 
system,  conditions  on  the  probability  of  detection  errors  have  been  found  that  will  make 
such  a  controller  feasible.  The  results  are  given  in  Reference  12  and  Appendix  G. 

Overall,  these  results  make  it  simpler  to  implement  lower  order  controllers  or  less 
complex  controllers  for  a  variety  of  hybrid  systems  that  either  exhibit  fast  and  slow 
dynamic  responses  or  satisfy  conditions  that  allow  their  robust  control. 
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SECTION  IV 


REALIZATION  AND  CONTROL 

The  section  addresses  several  issues  in  the  realization  models  for  hybrid  systems. 
These  models  can  lead  to  a  more  systematic  approach  to  the  identification  and  control 
of  these  systems.  Canonical  forms  for  the  periodic  hybrid  systems  have  been  developed 
in  Reference  13  and  shown  in  Appendix  H.  The  sensitivity  of  various  realizations  of  hybrid 
systems  have  been  developed  in  Reference  14  and  are  shown  in  Appendix  I.  The 
sensitivity  is  crucial  to  the  efficiency  of  any  identification  or  control  schemes  that  needs 
to  be  used  in  conjunction  with  specific  realization.  A  special  case  of  hybrid  systems  that 
have  linear  relations  among  its  continuous  states  can  be  represented  as  singular  hybrid 
system.  These  systems  may  also  be  considered  as  a  limiting  case  of  singularly  perturbed 
systems  discussed  in  Section  III.  References  15  and  16  discuss  general  approaches  to 
the  realization  problem  of  such  systems  that  have  implication  on  their  control.  The  results 
are  shown  in  Appendix  J  and  Appendix  K.  Rnally.  for  randomly  changing  hybrid  systems 
and  their  underlying  Markov  chains  a  novel  representation  for  the  system  is  given  in 
Reference  17  and  Appendix  L.  Similarly,  a  novel  realization  theory  has  been  proposed 
in  Reference  18  and  shown  in  Appendix  M  for  the  realization  of  Markov  chains  that  are 
crucial  to  the  analysis  and  control  of  hybrid  systems. 


6 


SECTION  V 


SUMMARY  AND  CONCLUSIONS 

The  research  summarized  in  this  report  and  supported  by  the  Air  Force  Grant 
provides  the  basis  for  the  design  of  estimators  controllers  for  systems  subject  to  random 
fluctuations  in  their  models  and  their  environments.  The  controllers  and  estimators  are 
not  optimal  as  it  is  not  possible  to  implement  and  analytically  derive  an  implementable 
form.  Hence,  approximation  methods  have  been  studied  for  the  derivation  of 
implementable  control  scheme  and  filtering  schemes  for  such  systems.  Approximations 
using  slow  and  fast  dynamics  separation  and  reduced-order  modeling  have  been 
proposed  for  such  systems  to  simplify  the  control  and  estimation  implementation.  Rnally, 
applications  to  the  tracking  of  maneuvering  vehicles  have  been  proposed,  the  resulting 
approximate  filter  derived  and  simulated  for  several  one-dimensional  and  three 
dimensional  problems. 
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Abstract 

Smoothing  for  a  linear  system  driven  by  a  point  process  with  a  rate  that 
depends  on  the  state  of  the  system  is  considered.  The  observation  model  is 
the  integrated  version  of  a  linear  combination  of  the  states  in  additive  white 
Gaussian  noise.  A  smoother  that  uses  estimation  and  detection  is  compared 
with  the  optimal  linear  smoother  and  filter.  The  comparison  is  in  terms  of 
the  mean  squared  error  (MSE)  of  the  state.  The  false  alarm  rate  of  the 
detector  is  shown  to  depend  strongly  on  the  region  of  support  of  the  mark 
distribution.  When  false  alarms  are  low,  the  estimation/detection  scheme 
has  lower  MSE  than  the  optimal  linear  smoother.  ‘ 

I.  Introduction 

We  consider  the  state  estimation  problem  for  the  following  single  input  sys¬ 
tem: 

dxt  =  Axidt  +  BdMt,  t  >  0 
with  the  scalar  observation  process 


■ft 


Cxt^s  4-  Vt 


where  ut  is  a  Wiener  process  with  Vrdr.  The  n  x  n  matrix 

A  j  s  .ch  that  the  solution  to  s  =  Ax  is  exponentially  stable.  The  scalar 
process  A/t  is  a  random  jump  process  with  jump  heights,  or  marks,  that  are 
independent  and  identically  distributed  and  with  jump  times  that  occur  with 
an  instantaneous  average  rate  p[xt].  Thus  the  rate  of  jumps  depends  on  the 
system  state.  An  example  that  motivates  this  model  is  a  manuevering  vehicle 
where  a  jump  represents  an  abrupt  change  in  acceleration.  The  likelihood  of 
acceleration  conunands  can  depend  on  the  position  and  velocity  of  the  vehicle. 
Another  example  is  an  electromechanical  system  where  the  jumps  represent 
failures  with  a  likelihood  of  occurrence  that  increases  under  conditions  of 
excessive  heat  or  current.  We  are  interested  in  cases  where  the  rate  of  jumps 
is  low  compared  to  the  bandwidth  of  the  system;  under  this  condition,  the 
state  is  not  well  approximated  by  a  Gaussian  distribution. 

The  process  Zt  is  easily  shown  to  be  in  the  class  of  '^iecewise-deterministic 
Markov  processes,”  defined  by  Davis  [1].  Filtering  and  smoothing  for  systems 
driven  by  Poisson  processes  have  been  considered  by  Kwakemaak  [2,  3]  aind 
Au  [4],  and  for  a  related  process  by  Blom  [5]. 


n.  The  Prior  Penalty  Detector 

This  scheme  uses  observations  over  an  interval  to  detect  the  number  of  jumps 
within  the  interval  and  estimate  the  times  and  marks  of  the  jumps.  The 
state  estimate  is  constructed  by  superimposing  the  system  responses  to  the 
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detected  jumps.  In  order  to  reduce  computational  complexity  and  memory 
requirements,  new  observations  are  used  to  detect  new  jumps  and  update 
only  recently  detected  jumps.  Specifically,  the  observations  over  the  interval 
[/i,i4  +  r),  denoted  by  2x,vi+Ti  are  used  to  detect  the  number  Na,a+t  of 
jumps  in  the  interval  and  to  estimate  the  vector  of  jump  times  ^  and 
marks  of  the  jumps.  Thus  fixed  interval  smoothing  is  performed 

on  the  observations  in  [A,  A  +  T).  Then  the  interval  is  moved  forward  to 
[j4  +  A,  .4  +  A  +  T),  and  fixed-interval  smoothing  is  performed  over  the  new 
interval.  A  detected  jump  that  is  left  behind  by  the  moving  interval  is  called  a 
“finalized  detection.”  Here,  A  is  small  enough  such  that  Pr{iV,4 ,4+^  >  1}  C 
1.  The  system  responses  to  the  finalized  detections  are  superimposed  to 
construct  an  estimate  of  the  state  with  a  fixed  lag.  For  the  sake  of  notational 
simplicity,  the  following  expressions  assume  that  the  estimation  and  detection 
is  performed  on  the  interval  [0,T),  and  that  the  initial  state  zq  is  known.  In 
a  sequential  implementation,  the  interval  is  changed  to  {A,A  +  T)  and  zq  is 
replaced  by  the  smoothed  estimate,  z,4. 

It  is  noted  that  the  maximum  a  posteriori  (MAP)  estimate  of  can 
be  expressed  as 

LMAp{n\ZQT,X(i] 

[  Pr{/^o,T  =  0|zo},  n  =  0 

=  S  ^r.,u,(A{2o.Tll„,«„,iVo,T  =  n,Zo})  (I) 

(  X  Pr{iVo,r  =  nl®o},  n  >  0 

where  AlJ^o^xIlni  Jin>  ■^o.T  =  is  the  likelihood  functioned.  The  detector 
in  the  present  scheme  replaces  the  averaged  likelihood  functional  in  (1)  with 
the  likelihood  functional  evaluated  at  the  MAP  estimates  of  and  u„,  given 
that  Noj-  =  n.  Therefore  the  decision  variable  is 

*0} 

_  r  Pr{lVo,T  =  0|zo},  n  =  0 

\  A{2o.T|inii%.^o,T  =  »*,zo}Pr{iVo,r  =  T»|io},  «  >  0 

We  call  this  scheme  the  Prior  Penalty  Detector  (PPD)  because  the  a  priori 
probablity  Pr{lV’o,r  =  n|zo}  serves  as  a  penalty  for  overfitting  and  can  be 
computed  offline  for  the  desired  range  of  values  for  zq. 

III.  Simulation  Results 

Four  examples  are  used  to  compare  the  performances  of  the  optimal  linear 
filter,  the  PPD,  and  the  optimal  linear  smoother  with  the  same  lag.  The  per¬ 
formance  is  measured  in  terms  of  the  mean  squared  error  (MSE),  normalized 
by  state  variance,  and  the  average  number  of  false  detections  per  true  pulse 
as  a  function  of  noise  variance.  The  MSE  for  the  optimal  filter  and  smoother 
is  computed  using  the  Bode-Shannon  method  [6].  The  MSE  for  the  PPD  is 
found  by  time  averaging  the  computer-simulated  output. 
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All  examples  have  the  same  scalar  system  model  of  dxt  =  —bxtdt  +  dMf 
The  examples  differ  in  the  rate  function  ti[x]  and  the  mark  pdf  pu(u),  as 
shown  in  Figure  1.  This  type  of  rate  function  was  chosen  to  yield  “bursty” 
behavior  in  the  sample  trajectories.  If  the  detector  succeeds  in  detecting  the 
first  few  pulses  that  move  the  state  into  a  high  rate  region,  then  the  detector 
changes  its  characteristics  to  allow  more  detections.  The  pdf’s  were  chosen 
to  illustrate  the  effect  the  mark  pdf  has  on  the  number  of  false  detections. 

The  MSE  results  are  shown  in  Figures  2  through  5  for  Examples  1  through 
4,  respectively.  The  false  alarm  rates  are  shown  in  Figure  6.  We  observe 
that  for  Example  1,  the  PPD  has  a  lower  MSE  than  the  optimal  filter  and 
smoother,  and  has  very  few  false  alarms.  This  is  because  the  region  of  support 
of  the  mark  pdf  is  confined  to  the  positive  axis  and  does  not  permit  arbitrarily 
small  pulses.  In  Example  2,  the  MSE  of  the  PPD  is  only  slightly  lower  than 
that  of  the  linear  smoother.  The  degradation  in  PPD  performance  relative  to 
Example  1  is  due  to  the  increased  number  of  false  alarms  with  small  marks. 
As  the  noise  variance  increases,  the  PPD  makes  about  the  same  number  of 
false  alarms,  but  with  larger  marks.  In  Example  3,  there  is  a  rather  dramatic 
correlation  between  MSE  and  the  false  alarm  rate,  as  both  increase  with  the 
noise  variance.  The  mark  pdf  for  this  example  allows  the  fadse  alarms  to 
have  large  positive  and  negative  marks  that  nearly  cancel.  However,  time 
quantization  in  the  simulation  does  not  allow  such  overlapping  false  alarms 
to  approach  perfect  cancellation  as  the  noise  variance  decreases.  The  mark 
pdf  for  Example  4  was  selected  to  give  the  worst  case  performance  of  the 
PPD  because  it  allows  arbitrarily  small  false  alarm  marks  as  well  as  large 
false  alarms  that  nesiriy  cancel.  Again  the  false  2d2um  rate  is  independent  of 
the  noise  variance,  but  the  rate  is  larger  than  for  Example  2  because  there  is 
no  penalty  for  arbitrarily  small  marks.  Also  the  optimal  linear  smoother  is 
consistently  better  in  terms  of  MSE. 

rV.  Conclusions 

The  simulation  results  indicate  that  if  the  PPD  false  alarm  rate  remains 
below  3  per  true  pulse,  the  PPD  yields  an  MSE  lower  than  the  optimal  linear 
smoother  with  the  same  lag.  It  is  noted  that  these  results  are  somewhat 
biased  in  favor  of  the  linear  estimators.  One  reason  is  that  the  MSE  for 
the  linear  smoother  and  filter  are  evaluated  using  a  formula  that  assumes  an 
infinite  observation  interval,  rather  than  an  interval  of  length  equal  to  the  lag 
as  in  the  PPD.  Another  reason  is  that  although  the  ratios  of  (jump  rate  times 
mean  squared  mark  value)-to-(system  bandwidth  times  noise  variance)  for 
our  examples  were  useful  for  studying  false  alarm  behavior,  they  guaranteed 
good  performance  for  the  linear  estimators.  To  see  poorer  performance  by 
the  linear  estimators  and  better  performance  by  the  PPD,  one  should  reduce 
this  ratio. 
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Figure  2.  Normalized  mean  squared  error  for  Example  1. 


Figure  2.  Normalized  mean  S4tuared  error  for  Example  2. 


Figure  4.  Normalized  mean  squared  error  for  Example  3. 


Observation  Noise  Strength,  V 

Figure  S.  Normalized  mean  squared  error  for  Example  4. 


using  the  PPD. 
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ABSTRACT 

The  Kalman  filter  provides  optimal  state  estimates  for 
completely  Icnown  linear  systems.  Unfortunately,  many  physical 
systems  arc  neither  exactly  kitown,  nor  linear.  Numerous  Altering 
schemes  for  nonlinear  systems  have  been  introduced  over  the 
years:  general  theories  for  nonlinear  systems  tend  to  be  complex, 
aixl,  due  to  their  generality,  are  of  little  pracAcal  use  to  the  design 
engineer.  On  the  other  hand,  solutions  for  speciAc  nonUnearities 
usually  apply  only  to  a  single  nonlinearity,  arid  thus  are  limited  in 
their  applications.  This  paper,  however,  presents  a  methodology 
whereby  the  norUinearity  is  Arst  approximated  by  a  piecewise 
linear  model,  and  tlien  a  common  Altering  scheme  is  applied.  The 
ef  Acacy  of  this  approach  is  that  the  same  Altering  algorithm  may 
be  applied  to  a  broad  class  of  nonlinear  siochasAc  systems. 

I  INTRODUCTION 

SpcciAcally,  the  problem  at  hand  assumes  that,  given 
nonlinear  observauons  y{k),  it  is  desired  to  esumate  the  state  x{k) 
of  the  system 

+  (1) 

where  x,  is  the  n  vector  representing  the  system  state  at  time  k.  w, 
is  a  whim,  discrete  ume  /-dimensional  veemr  Gaussian  random 
process  with  covariance  matrix  Q,  and  b  is  an  n  x  /  dimensional 
matrix.  The  observation  model  is  assumed  to  be  given  by 

y,  =  /i(i,)  +  v,  (2) 

where  y,  is  an  /n-dimensional  vector  which  represents  the 
observauon  at  time  k.  and  v,  is  an  m-dimensional  white  Gaussian 
measurement  noise  process  with  covariance  R.  The  state 
propagation  Ainction  g{  )  and  the  observation  funcuon  h(-)  are 
allowed  to  be  nonlinear. 

The  nonlincarities  in  (1)  and  (2)  ate  approximated  by  the 
continuous  piecewise  linear  approximation  given  by  the  following 


model: 

g(x)  =  G,x*g, 

for 

xe  fl,. 

1  =  1. 

...M, 

(3) 

and 

Au )  = /f.x A, 

for 

xe 

1  =  1. 

W 

where  {Q,, }  and{G« }  partition  the  state  and  measurement  spaces. 
For  simplicity  of  notation,  the  cross  pttxlua  of  the  two  parddons 
may  be  formed  to  yield  one  parduon, 

{Q, },  1  =  I . M-.  M  <  At  each  time  step  k,  the  system  is 

as.sumcd  to  be  governed  by  one  of  the  M  models.  These  M  system 
models  arc  called  sysutm  macro-states. 

The  Anal  model  assumption  is  that  the  system  jumps  from 
macro-stau:  i  to  macro-state  j  according  to  a  Anite  state  Markov 
process.  J.  In  order  U)  mainuin  a  "memory”  of  the  last  r  time 
steps,  a  parameter /(I: )  is  inutiduced.  where  J{k)  represents  the  set 
of  the  .VT  macro-sDtes,  i.e., 

=  . (5) 

where 

j,6{1.2 . M]  (6) 

Conceptually,  the  new  Alter  is  based  on  two  assumptions  for 
the  sysuim  model;  I )  The  nonlincarities  may  be  approximated  by 
continuous  pieccwi.se  linear  funcuons  and  2)  This  N-segment 
piecewise  linear  model  may  be  approximated  by  S  separate  afAnc 
systems  driven  by  the  same  process,  with  the  true  system  output 
being  approximated  by  randomly  selecting  one  of  die  N  outputs. 
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Conditions  supporting  the  validity  of  these  assumptions  are 
derived  in  [1].  The  second  assumption  allows  application  of  the 
theory  of  switching  systems.  The  optimal  (albeit  non-realizable) 
Alter  for  switching  systems  was  introduced  by  Ackerson  and  Fu  in 
19’’0  [2],  and  consists  of  a  likelihood-weighted  sum  of  Kalman 
Alters  "tuned"  to  each  possible  switching  sequence,  and.  hence, 
involves  exponendally  increasing  complexity  with  time.  Their 
paper  did  not  consider  the  additional  structure  present  in  the 
hybrid-state  model  where  the  dynamics  of  the  system  macro-state 
are  independent  of  the  system  state,  but  where  the  sysrem  state  is 
not  independent  of  the  macro-state.  The  optimal  Alter  for  this 
hybrid-state  model  was  presented  by  Bmneau  and  Tenney  in  [3|. 
It  is  shown  that  this  Alter  is  also  inAnite  dimensional  and 
nonrealizable.  Numerous  schemes  have  been  introduced  to 
reduce  the  Alter  complexity  by  casting  away  unlikely  trajcctoncs 
or  combining  similar  estimates  (see  e.g.  [4]). 

A  primary  difference  between  this  work  aiKl  that  of  [2]  is  that 
here,  in  the  underlying  piecewise  linear  model,  the  system 
macro-state  is  a  function  of  the  system  state.  This  fact  is  exploited 
in  the  Altering  algorithm  through  the  consistency  update  stage.  A 
consistency  update  occurs  when  the  state  esumate  for  the  t" 
model,  if,  is  compared  with  the  domain  of  the  i"*  lirte  segment  (in 
the  scalar  case).  If  the  state  esUmate  produced  by  a  given  Alter  is 
not  within  the  domain  of  the  line  segment,  then  the  state 
estimate  is  said  to  be  inconsistent  with  its  macro-state,  and  less 
weight  is  placed  on  that  esAmate. 

n  HLTERING  SCHEME 

The  Altering  scheme  applied  here  consists  of  maintaining  one 
Kalman  Alter  "tuned"  to  each  of  Ate  AT  macro-sute  trajectories. 
Thus,  for  each  new  oteervaAon.  an  enAre  set  of  AT  innovauons 
will  be  formed  -  one  for  each  Alter.  Next.  Ate  usual  measurement 
and  Arne  updates  will  be  perfomied  for  each  Alter,  producing  Sf 
covariaiKe  matrices  and  individual  state  esAmates.  Next,  the 
overall  combined  esAmate.  i\  is  fomted  from  Ak  likelihood 
weighted  sum  of  Atese  AC  individual  esAmates.  Finally.  Aie  Alters 
are  aggregated,  and  the  condiAonal  probabiliAes  and  likcAhood 
AiiicAons  are  modlAed  according  to  the  consistency  update  stage. 
A  detailed  summary  of  Aiese  steps  is  provided  here.  AddiAonal 
details  of  Ak  algoriAim  may  be  found  in  [5]. 

Before  describing  Ak  individual  AlKring  steps,  some 
addiAonal  deAniuons  are  required.  The  .Markov  uansiAon  matrix, 
n.  specifying  Ak  transidon  probabiliAes  from  macro-stare  5,  to  5^ 
is  obtaitKd  from; 

n,,  =  Crfi,.,  6  n,  (z,  6  Q. }  (7) 

The  marginal  steady-state  probabiliAes  p,  of  macro-sute  5,  are 
deAned  by  the  soluAon  to; 

p  =  pn  (8) 

where  p  is  a  row  vector  with  components  p,.  The  a  posteriori 
probability  Aiat  Ak  system  is  in  macro-sure  i  at  Arne  k  may  be 
expressed  as 

/!,(*)=  S  A,„„(l:)  (9) 

r»-i) 

where  J(k;i)  denotes  all  M'"'  sequences  at  time  k  which  end  in 
macro-sure  i. 

Consistency  Update 

If  the  variaiKe  of  any  individual  esAmate.  P,n  -  n.  is  small  then 
Ak  infotmauon  provided  by  fi  may  be  neglected.  In  Aiis  case. 
AKse  values  are  changed  based  on  Ak  posiuon  of  the  estimate 
in  Ak  appropriate  region  G,.  and  used  to  update  Ak  a 
posteriori  macro  state  probabiliAes  fi,{k  I  J(k  -  D).  In  turn,  these 
are  used  in  the  rrexi  suge  for  updaAng  pXk  -rill;).  If,  on  Ak  oArer 


hand,  the  individuaJ  estimaic  covariance  is  lai^ge,  the  macro  state 
infonnation  is  weighted  more  heavily  in  determining  the  macro 
state  piobabilitics.  In  this  work,  this  updating  stage  was  achieved 
through  the  following  equation: 

fi,(k  \J{k-\))  =  (!(/>,(,. „)p,(i  17(4 -  1))  + 

(10) 

Here.  aU.P )  is  a  function  of  the  norm  q(P  which  tends  to  zero  as  P 
becomes  small,  and  which  tends  to  unity  as  P  becomes  large.  The 
operator  UXx)  is  an  indicator  ftinction  that  is  equal  to  unity  if 
z  €  Q,,  and  is  zero  otherwise. 

Time  Update 

The  macro  state  probabilities  are  updated  by  using  the 
consistency  updated  values  together  with  the  transition 
probabilities, 

+  >  I  4)  =  p74  I l))A/<,.,)(*)nj,  (II) 

Time  updates  of  the  individual  state  and  covariance  estimates  are 
achieved  via  the  standard  Kalman  filter  equations  for  the 
appropriate  models. 

Measurement  Update 

As  above,  the  individual  state  estimate,  the  innovations,  and 
the  covariance  may  be  calculated  using  the  Kalman  filter  for  the 
appropriate  model  under  consideration.  The  question  now  is 
concerned  with  the  measurement  update  of  the  macro  state 
probability  estimates.  This  can  be  accomplished  by  using  the 
standard  likelihood  function  for  a  switched-Markov  model, 
which,  it  should  be  noted,  is  only  an  approximation  in  this  case. 
The  expression  for  the  a  posteriori  probabilities  in  this  case  will 
be  proportional  to  the  likelihood  functions  A,(,.,j,(4).  The  update 
equation  is 

"t"  1)  =  0A^pi,,jj(4  1  1  4)x 


in  ANALYSIS 

Since  the  filter  is  complex  and  nonlinear,  it  has  yet  to  succumb 
to  any  closed-form  performance  analysis  techniques.  Hence. 
Monte-Carlo  simulation  techniques  were  used  to  assess  its 
performance.  In  the  simulation,  a  scalar  version  of  the  proposed 
filter  with  memory,  r,  of  length  one  (PFl)  and  four  (PF4)  is 
compared  to  a  standard  Extended  Kalman  Filter  (EKF).  Both  the 
system  function  (1)  as  well  as  the  measurement  function  (2)  are 
defined  by  3  segment  affine  maps  g{x)=‘Ci  +  k,  and 
h(x)  =  Hx  +kt,  where: 


2.0, 1 X  |<  1 

-0.2,|x|>l 


5.0.|x|<0.5 

-0.1,|x|>0.5 


(H) 


J2.2sgn(x),  lxl>l  f2.55sgn(x).  1j:I>0.5 

IxlSl  *’  =  l0,  |x|<l 

Both  Q  and  R  were  set  to  unity  and  was  varied  from  I  to  10. 
Figure  1.  is  a  graph  which  depicts  the  rei.'itive  error  variance  as  a 
fiinction  of  b  parameterized  by  the  filter  type  (PFl,  PF4,  or  the 
EKF).  As  can  be  seen,  the  improvements  between  cither  of  the 
proposed  filters  and  the  EKF  is  striking.  Additional  simulations 
(not  presented  here)  indicate  similar  trends,  with  the  best 
performance  increases  being  seen  for  non-injective  nonlinearities. 


exp|-5V'/(,.,„(l:  +  -t- 1)  (12) 

where  0  is  a  nomialization  coefficient,  the  are  the 

iniKhrations  processes  arising  from  the  Kalman  filter  tuned  to  the 
J(k  Ur)  model,  and  A  represents  the  consistency  updated 
likelihood  value. 

Combined  Estinute 

The  combined  estimate  x'‘(4)  is  obtained  by  using  the 
likelihood-weighted  sum  of  the  individual  estimates,  i.e. 

i'(4)=lA,a,(*lx,(„(4)  (13) 

Aggregation 

To  avoid  expanding  memory,  it  is  necessary  to  reduce  the 
number  of  filters  at  each  time  sti^.  This  may  be  achieved  in  a 
number  of  ways  including  casting  away  unlikely  sequences, 
merging  similar  sequences,  or.  the  preach  taken  here, 
systematically  aggregating  at  the  earliest  time. 

The  technique  developed  for  reducing  the  number  of  filters 
required  to  Af,  is  as  follows:  consider  the  collection  of 
macro-state  sequences  ,7^(4)  to  be  the  sequences  (of  length  r)  at 
time  4.  that  began  in  macro-sute  i.  and  progressed  to  macro-state 
y  at  the  next  time  step.  Similarly,  the  notation  ,7(4)  indicates  the 
set  of  all  sequences  that  began  in  macrostate  4.  The  aggregation 
step  involves  forming  likelihood  weighted  sums  over  the  index  i 
for  each  ;.  thus  reducing  the  filter  memory  by  one. 

I  ,  0\  •  j  •  I 
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Figure  1.  Filter  State  Estimate  Error  Variance  Performance 

IV  SUMMARY  .\ND  CONCLUSIONS 
This  paper  presents  a  new  sub-optimal  filter  to  be  used  for  the 
nonlinear  estimation  problem  in  systems  with  piecewise  linear 
models  in  both  the  system  and  observation  equations.  The 
approximations  used  are  based  on  utilizing  the  switched-Markov 
.model  for  the  system  as  well  as  on  modifying  the  resulting  filter 
with  the  physical  constraints  of  the  states  of  the  model.  Not  all 
facets  of  the  filter  are  in  final  form,  and  work  remains  in  the  area 
of  the  exaa  formulation  of  the  consistency  update,  as  well  as  filter 
aggregation.  Nonetheless,  preliminary  results  show  that  the  filrer 
may  work  well  in  a  broad  class  of  norilinear  filtering  problems. 
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The  covariance  is  updated  using  the  Gaussian  sum  approximation, 
i.e. 

tt  .  . 
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ABSTRACT 

A  new  filtering  concept  is  presented  for  tracking  maiKuvering 
targets.  A  conventional  Maikov  switching  process  is  used  to  model  the 
target  maneuver  process,  but  a  new  filtering  scheme  is  employed.  The 
filter  uses  a  traditional  track-splitting  approach,  with  one  Kalman  filter 
tuned  to  each  branch  of  the  tree.  To  limit  filter  complexity,  aggregation 
is  performed  over  the  eadiest  timestep  of  an  arbitrary  filter  memory 
length.  Before  aggregation,  a  unique  consistency  update  stage  is 
employed  where  each  of  the  filter’s  state  estimates  is  compared  with  the 
associated  conditional  model  for  that  filter.  If  the  two  are  inconsistent, 
(e.g.  a  large  acceleration  component  generated  from  a  non-maneuvering 
model),  less  weight  is  placed  on  that  estimate.  Results  are  presented 
from  a  full  3-D  tracking  model. 

I  INTRODUCTION 

The  coiKept  for  the  filtering  scheme  preseiued  here  arose  frixn  a 
nonlinear  filtering  algorithm  presented  earlier  [1].  In  the  nonlinear 
filtering  application,  the  nonlinearity  was  first  approximated  by  a 
piecewise  linear  model.  After  that,  a  filter  based  on  a  switching  model 
was  developed.  In  the  following  work,  the  maneuvering  target  problem 
is  naturally  described  by  a  switching  system,  allowing  for  the  application 
of  a  similar  filter  structure. 

Specifically,  the  problem  at  hand  assumes  that,  given  observations 
y{k),  it  is  desired  to  estimate  the  state  x(k)  of  the  system 

x..,=Ax.-vr,(>v.-^u,,)  (1) 

where  X,  is  the  n  vector  representing  the  system  state  at  time  k,  w,  is  a 
white,  discrete  time  /-dimensional  vector  Gaussian  random  process  with 
covariance  matrix  Q.  u,is  a  deterministic,  but  unknown  control  input, 

and  Ft  is  an  n  X  /  dimensional  matrix.  The  observation  model  is  given  by 
y,  =  Hx,  +  v,  (2) 

where  y,  is  an  m-dimensional  vector  which  represents  the  observation  at 
time  k,  and  v,  is  an  m-dimensional  white  Gaussian  measurement  noise 
process  with  covariance  K. 

For  the  maneuvering  target  problem,  we  partition  the  acceleration 

comnoncnt  of  the  state  space  into  Af  regions,  {Qi),i  =  1 . M.  The 

model  allows  the  system  to  jump  from  maneuver  state  (control  input)  t 
to  maneuver  state  j  according  to  a  finite  state  Maikov  process,  J.  In 
order  to  mairaain  a  "memory''  of  the  last  r  time  steps,  a  parameter /(k)  is 
introduced,  where  J(k)  represents  the  set  of  the  AT  maneuver  stales,  i.e., 
{-/(k)}  —  (3) 

where 

>,€{1.2 . Af)  (4) 

In  this  work,  the  acceleration  partition  width  was  set  to  -  I. 

where  represents  the  maximum  acceleration  to  be  modelled. 

The  optimal  (albeit  non-realizable)  filter  for  switching  systems  was 
introduced  by  Ackeison  and  Fu  in  1970  [2],  and  consists  of  a 
likelihood-weighted  sum  of  Kalman  filters  "miKd"  to  each  possible 
switching  sequence,  and,  hence,  involves  exponentially  increasing 
complexity  with  time.  Numerous  schemes  have  been  introduced  to 
reduce  the  filter  complexity  by  casting  away  unlikely  trajectories  or 
combining  similar  estimates  (see  e.g.  [3]). 
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A  primary  difference  between  this  work  and  that  of  [2]  is  that  here, 
in  the  underlying  model,  the  system  maneuver  state  is  a  ftirxxion  of  the 
system  state.  This  fact  is  exploited  in  the  filtering  algorithm  through  the 
consistency  update  stage.  A  consistency  update  occurs  when  the  state 
estimate  for  the  f*  model,  i„  is  compared  with  the  domain  of  the 
maneuver  commarvl.  If  the  acceleration  componeiu  of  the  state  estimate 
produced  by  a  given  filter  is  not  within  the  domain  of  the  /’*  region,  then 
the  state  estimate  is  said  to  be  iiKonsistent  with  its  maneuver  state,  and 
less  weight  is  placed  on  that  estimate. 


n  FILTERING  SCHEME 

The  filtering  scheme  applied  here  consists  of  maintaiiting  one 
Kalman  filter  "tuned"  to  each  of  the  AT  maneuver  state  trajectories. 
Thus,  for  each  new  observation,  an  entire  set  of  AF  iimovations  will  be 
formed — one  for  each  filter.  Next,  the  usual  measurement  and  time 
updates  will  be  performed  for  each  filter,  producing  AF  covariance 
matrices  and  iitdividual  state  estimates.  Next,  the  overall  combined 
estimate,  i*,  is  formed  from  the  likelihood  weighted  sum  of  these  AF 
individual  estimates.  Finally,  the  filters  are  aggregated,  and  the 
conditional  probabilities  and  likelihood  functions  are  modified 
according  to  the  consistency  update  stage.  A  detailed  summary  of  these 
steps  is  provided  here.  Additional  details  of  the  algorithm  may  be  found 
in  (4]. 

Before  describing  the  individual  filtering  steps,  some  additional 
defittitions  are  required.  The  Maikov  transition  matrix,  n.  specifying  the 
transition  probabilities  from  maiteuver  state  5,  to  Sy  is  obtained  from: 

rV»Fr{«j.,6  Oylu.e  Q,}  (5) 

The  a  posteriori  probability  that  the  system  is  in  maneuver  state  r  at 
time  k  may  be  expres^  as 

/>.(*)=  I  Ay,,^(k)  (6) 


where  J(k:i)  denotes  all  AF"'  sequences  at  time  k  which  end  in 
macro-state  r. 

Target  Model 

The  explicit  state  equations,  assuming  a  1  second  update  rate,  are  as 
follows: 
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where  p,  v,  arxl  a  represent  the  targets'  position,  velocity  and 
acceleration,  respectively.  Although  numerous  coordinate  systems  exist 
for  target  tracking,  polar-spherical  (measurement)  coordinates  have  been 
selected  for  this  analysis.  Using  well-known  [5]  approximations,  this 
formulation  leads  to  tluee  loosely  coupled  state  equations — one  each  for 
range,  elevation,  arxl  bearing.  This  leaves  us  with  three  3x3  systems 
ratiierthan  oik  9x9.  The  impaa  of  this  is  more  important,  though,  when 
the  complexity  introduced  through  multi-model  approach  is  considered. 
IfN  different  marreuver  commands  are  modelled  for  each  axis,  then  there 
are  s’  possible  systems  for  the  coupled  filter,  and  only  3N  for  the 
decoupled  case.  When  a  memory  of  Af  timesteps  is  admitted  then  we 

have  Af*’  possible  systems  for  the  coupled  case  and  only  3N*‘  for  the 
decoupled  case.  If  S  maiKuver  commands,  and  a  memory  of  3  timesteps 
are  considered,  the  decoupled  filter  requires  less  than  two  percent  of  tire 
complexity  of  the  fully  coupled  system.  Thus,  for  the  same 
compuutional  complexity,  many  more  maneuver  commands  could  be 
added  to  the  decoupled  filter. 

In  summary,  then,  data  are  measured  in  the  spherical  coordinates, 
range,  elevation  and  bearing.  Iitdependent  filtering  is  performed  in  each 
coordinate  dimension  by  the  ikw  filtering  scheme.  A  single  estimate  is 
produced  in  each  of  the  three  dimensions,  at  each  timestep. 
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Consistency  Update 

If  the  variance  of  any  individual  estimate,  is  small  then  the 
infonnation  provided  by  fi  may  be  neglected.  In  this  case,  these  values 
are  changed  based  on  the  position  of  the  estimate  in  the 

appropriate  region  Q,,  and  used  to  update  the  a  posteriori  macro  state 
probabilities  \  J(k  - 1)).  In  turn,  these  are  used  in  the  next  stage  for 
updating  fixk  -*-11^).  If,  on  the  other  hand,  the  individual  estimate 
covariance  is  large,  the  macro  state  infonnation  is  weighted  more  heavily 
in  determining  the  macro  state  probabilities.  In  this  work,  this  updadng 
stage  was  achieved  through  the  following  equation: 
p,(k  \J{k  -  l))  =  a(f»,,.„)|i,(i  \J{k  - 1))  + 


Here,  a(P)  is  a  fttnction  of  the  norm  of  F  which  tends  to  zero  as  F 
becomes  small,  and  which  tends  to  unity  as  F  becomes  large.  The 
operator  U,{x)  is  an  indicator  ftnction  that  is  equal  to  unity  if  2  e  Q,,  and 
is  zero  otherwise. 

Time  Update 

The  macro  state  probabilities  are  updated  by  using  the  consistency 
updated  values  p,  together  with  the  transition  probabilities, 

A,, ,  „#  +  1 1  *)  =  pjik  \J(K-\  M,, .  „(l:)n^,  (9) 

Time  updates  of  the  individual  state  and  covariance  estimates  are 
achieved  via  the  standard  Kalman  filter  equations  for  the  appropriate 
models. 

Measurement  Update 

As  above,  the  individual  state  estimate,  the  innovations,  and  the 
covariance  may  be  calculated  using  the  Kalman  filter  for  the  appropriate 
model  under  consideration.  The  question  now  is  concerned  with  the 
measurement  update  of  the  macro  state  probability  estimates.  This  can 
be  accomplished  by  using  the  standard  likelihood  function  for  a 
swiiched-Markov  model.  The  expression  for  the  a  posteriori 
probabilities  in  this  case  will  be  proportional  to  the  likelihood  functions 
The  update  equation  is 

+  1 )  *  PAyd ,  +  1 1  k)  X 


xp{-| 


(10) 


where  P  is  a  normalization  coefficient,  the  are  the  innovations 
processes  arising  from  the  Kalman  filter  tuned  to  the  /(k  1;/)  model, 
and  A  represents  the  consistency  updated  likelihood  value. 


Combined  Estinute 

The  combined  estimate  i*(k)  is  obtained  by  using  the 
likelihood-weighted  sum  of  the  individual  estimates,  i.e. 

f'(k)=ZA„/kX,a^k)  (11) 

rm 

Aggregation 

To  avoid  expanding  memory,  it  is  necessary  to  reduce  the  number  of 
filters  at  each  time  step.  This  may  be  achieved  in  a  number  of  ways 
including  casting  away  unlikely  sequences,  merging  similar  sequerKes, 
or,  the  approach  taken  here,  systematically  aggregating  at  the  earliest 
time. 

The  techrtique  developed  for  reducing  the  number  of  requited  filters 
to  AC.  is  as  follows:  consider  the  collection  of  macro-state  sequences 
,//(k)  to  be  the  sequences  (of  length  r)  at  time  k,  that  began  in 
macro-state  i,  and  progressed  to  macro-state  y  at  the  next  time  step. 
Similarly,  the  notation  ,J{k)  indicates  the  set  of  ail  sequetKes  that  began 
in  maneuver  state  k.  The  aggregation  step  involves  forming  likelihood 
weighted  sums  over  the  index  i  for  each  j,  thus  reducing  the  filter 
memory  by  one, 

n 

/a)  (12) 

The  covariarKe  is  updated  using  the  Gaussian  sum  approximation,  i.e. 

=  1  A  +  (13) 

and 
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m  RESULTS 

The  filter  was  implemented  on  a  digital  computer  using  S  maneuver 
commands  per  axis  arxl  3  timesteps  of  memory.  Further,  a  simple 
3-state,  3-axis  Kalman  filter  was  implemented  as  a  first-cut  benchmark. 
The  Kalman  filler  noise  covariance  parameters  and  the  parameters  of  the 
new  filter  were  set  equal  Realistic  flightpaih  data  were  generated  using 
the  Air  Force  BLUEMAX.n  flightpath  geiKrator  program,  with  the 
F-16A  aircraft  characteristics  file.  Space  here  allows  inclusion  of  only  a 
single  tracking  perfoimartce  example.  For  this  example,  the  target  flies 
along  the  X-axis  at  a  speed  of  500  fps  until  it  reaches  z=2000  feet  At 
this  point,  the  afterburner  is  turned  on  and  the  target  initiates  a  strong 
climbing  left-hand  turn.  In  figure  1.  the  acmal  trajectory  is  highlighted 
with  an  X  on  each  data  point  The  trajectory  highlighted  with  diamonds 
shows  the  perfonnance  of  the  simple  Kalman  filter,  while  the  trajectory 
indicated  with  circles  shows  the  performaiKe  of  the  tkw  filter.  As  can 
be  seen,  the  transient  behavior  of  the  new  filter  is  superior. 


rv  SUMMARY  AND  CONCLUSIONS 
This  paper  presents  a  new  filler  to  be  used  for  tracking  maneuvering 
targets.  Not  all  facets  of  the  filler  are  in  final  form,  arxl  work  remains  in 
the  area  of  the  exact  formulation  of  the  consistency  update,  as  well  as 
filter  aggregation  and  filter  tuning,  or  parameter  selection.  Nonetheless, 
preliminary  results  are  promising.  Clearly,  the  next  stage  of  the  research 
should  address  benchmarking  the  proposed  filter  against  some  other  well 
accepted  multi-model  filter  such  as  the  interacting  multiple  model  filter 
16). 
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Abstract-The  filtering,  smoothing,  and  prediction  prob¬ 
lems  for  mixed-type  states  and  observations(cantinuous  plus 
discontinuous)  are  considered.  Normalized  and  unnormal¬ 
ized  forms  for  the  corresponding  three  types  of  estimates  are 
obtained,  which  are  formulated  in  the  unified  frameworks. 

1  Introduction 

Elliott  and  Antonelli  [l,2j  discussed  the  smooching  and 
prediction  problems  for  Wiener-type  observations  in  terms 
of  semimartingale  decompositions  and  measure  transforma¬ 
tions.  Analogous  problems  for  counting  observations  were 
treated  in  [3j  and  [4]  independently.  We  extend  these  i.  as 
to  general  estimation  problems  where  both  the  signal  and 
the  observation  consist  of  Wiener  processes  and  counting 
processes,  and  moreover  there  exists  dependence  between 
signal  and  observation  noise. 

For  the  nonlinear  filtering  for  these  types  of  general  prob¬ 
lems,  Gertner[5|  obtained  optimal  conditional  expectations 
baaed  on  a  measure  transformation  and  a  Fubini-type  the¬ 
orem.  He  constructed  a  new  equivalent  reference  measure 
under  which  he  derived  normalized  and  unnormalized  dis¬ 
tributions  directly  from  the  definition  of  conditional  expec¬ 
tations  and  Bayesian  formulas,  not  relying  on  Innovation 
methods  and  appropriate  transformations. 

Based  on  Gertner’s  general  model,  we  will  obtain  nor¬ 
malized  and  unnormalized  estimates  for  the  smoothing  and 
prediction  problems  in  a  different  way.  Our  approach  is  first 
to  derive  the  normalized  forms  using  innovation  methods, 
then  the  unnormalized  forms  indirectly,  and  finally  direct 
derivations  of  the  unnormalized  forms  will  be  made.  This 
paper  shows  a  tmified  approach  to  general  nonlinear  estima¬ 
tion  for  smoothing  and  prediction  problems. 

One  example  of  such  a  mixed  type  observation  was  given 
by  Hoversten  et.al.  [6]  who  considered  that  in  optical  com¬ 
munication  receivers,  the  detector  output  currents  could  be 
modelled  as  stochastic  processes.  Those  processes  contain 
doubly  stochastic  Poisson  processes  due  to  photoelectron 
and  dark  current  and  a  Wiener  process  due  to  thermal  noise. 

General  terminology  and  assumptions  are  presented  in 
the  next  section. 

2  Notation  and  Preliminaries 

Let  (ft,  7 ,  P)  be  a  complete  probability  space  and  let 
{/(,{  €  [0)^]}  be  a  nondecreasing  family  of  a  -fields  of 
7  such  that  7i  is  right  continuous  and  7o  contains  all  null 
sets.  All  stochastic  processes  are  defined  on  {n,7,P]  and 
a  finite  time  Interval  [O.T],  and  are  scalar-valued.  We  de- 
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note  the  class  of  square  Integrable  (local)  martingales  by 
■M’(^,P)(resp.  MtM{7t,P)).  Assume  that  the  signal  and 
observation  processes  have  the  following  representation 

Xt  =  Xo  +  /  /udu  -t-  Mf  -f-  Hi, 

Jo 

Vt—f  /ludu  W,  -I-  JV). 

where  fi  and  ht  are  square-integrable  ^-adapted  processes, 
Mi,Wi  €  M^(7i,P)  with  <  W  >(=  f,  and  Hi  and  jV,  are 
integrable  counting  processes  such  that 

Qi  =  Hi- f%„du&M\7i,P),  (1) 

qi  =  N,-  f  Xiidu  e  M*(/„P).  (2) 

Jq 

where  'll,  A*  are  square-integrable  non-negative  /^-predictable 
processes.  We  assume  that  there  exist  ,^-predictable  pro¬ 
cesses  at,  bt  such  that 


=  /  o^du, 
vO 


<M,W  >, 


(F.jv],  =  (g,9i.  = 

JO  JO 

Next  we  describe  a  measure  transformation  which  plays 
a  key  role  in  obtaining  the  unnormalized  distribution. 

Define  a  measure  Po  equivalent  to  P  on  (H,  7)  by 

^  =  exp[-  hlda~  ]n{K)dN,+J^{X,~l)du\. 

By  Girsanov’s  theorem,  P  a  absolutely  continuous  with  re¬ 
spect  to  Po  with  Radon-Nlkodym  derivative 

+  l\{X^)dN^  -  jr’’(A.  -  l)dul. 
where  Wt  =Wi  +  Jo  h„du.  It  can  be  shown  that 

A,  =  expij*  h^dSV,,-^  J‘  hldu+f*  ln(A„)dW«- J\x„-l)du] 

“  0  0  “  (3) 

is  a  {Tu  such  that 

Furthermore  by  the  differential  rule,  A|  satisfies  the  integral 
equation 

A,  =  1  i.„h„d(V„  +  f  A«-(A„  -  l)<l(iV,  -  u).  (4) 

Jo  Jo 

Then  under  the  measure  transformation  the  observation  pro¬ 
cesses  are  simplified. 
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W,  =  W,+  f*  h^du  e  M*[Tt,Po), 

Jo 

*=$•+/  {K-  l)du  =  JV,  - 1  e  M*[7t,Po). 

Jo 

Note  that 

<  W  >,=<  W  >,=  t,  (51,  =  [?1.  =  N.  =  JV,. 

under  both  measures.  Similarly,  the  signal  process  is  also 
changed  under  the  new  measure  Po 

M  =  W.  +  f\(iudn  €  X*(/.,Po), 

=  <3«  +  f  “  l)fc«du  €  M*(^,Po). 

Jo 

Thus  imder  Pq  the  signal  and  observation  process  are  of  the 
form 

Xt  =  Xyj+  f  fudu  —  f  h^a^dii  +  M«  +  /  b^dtt 
Jo  Jo  Jo 

-  f  A«6udtt  +  Qi+  (  'tudti,  (5) 

Jo  Jo 

Y,=Wt  +  N,.  (6) 

It  is  also  known  that  Xt  has  the  same  distribution  under 

either  measure  [7]. 

The  following  theorem(Fubini-type  theorem  for  stochas¬ 
tic  integral)  is  of  critical  importance  in  the  derivation  of  the 
optimal  conditional  distribution.  It  provides  conditions  for 
interchanging  conditional  expectation  and  stochastic  into* 
gration  [S]. 

Theorem  1:  Let 


Mi,  V,  €  M\7„P),  Si  =  e{V„  s  <  <). 


Then 

E[Mi  1  =  /  4,dV, 

(T) 

where 

d<M,V>, 

**  d<V>i 

is  an  ^predictable  process,  and  is  a  ^rPredictable  pro¬ 
jection  of  Oi  onto  S*  *'tch  that 


E  iid<V  >^<oo. 

As  special  cases,  (l)  if  MW  then  E\Mi  \  ^i|  =  0. 

(2)  if  /« is  a  s<iuare-integrabie  ^-pre^ctable  process,  and  7i 
and  ^(((  >  s)  are  conditionally  independent  given  S** 

E[f^  fndY,\Si]  =  fjn<ar>  («) 

where  is  the  predictable  projection  of  /•  on  Qt-O 

Since  independent  increment  processes  Wi,  iVf  satisfy  (2), 
we  get  known  re8ults[5,6]: 

E[ f*  I  7^]  =  J*  E[K  I  7rw„ 

E[  A,d?,  I  7?]  =  I*  £(A,  1  7:.]<L,„ 

E[  fl  a.dtt  1  TT]  »  f*  E[a,.  \  7T]du, 


for  integrable,  ^-adapted  processes  h,  and  a,,  and  an  7,- 
predictable  process  A,. 

Next  is  the  general  martingale  representation  when  the 
filtration  is  generated  by  both  Poisson  and  Wiener  processes 
(7.  p246l. 

Theorem  2: 

Let  Mi  e  Hi^iyuP),  Si  =  (t(W.,N.,s  <  t),/(P  null 

sets). 

Then  there  exist  1/i-predictable  processes  Ki  and  R,  with 
<  00,  such  that 

Mi  =  Mo  +  J*  K.dJy.  +  R.dq, 
where  ft  =  Nj  —  □ 

We  are  now  in  a  position  to  derive  the  following  optimal 
estimates,  i.e.  the  conditional  expectation  of  Xi  given  the 
observation  <r-fields: 

for  the  filtering  case  Xi  =  n,(X()  =  E[Xi  |  Si] 
for  the  smoothing  case  ni(X,)  =  E[X,  j  SiKs  <  t) 
for  the  prediction  case  n,(X|)  =  E[Xi  |  y,](s  <  t) 
For  the  filtering  problem,  Gertner  [5]  derived  the  normal¬ 
ized  and  unnormalized  equations  directly  from  the  defini¬ 
tion  of  conditional  distribution  and  Bayesian  formulas  under 
the  transformed  measure.  We  take  alternative  approaches 
through  two  steps,  Le.,  the  innovation  approach  with  semi¬ 
martingale  decomposition  method  for  normalized  forms  [1] 
and  then  the  measure  transformation  approach  for  unnor¬ 
malized  forms  (indirect  derivations)  [2j.  For  smoothing  and 
prediction  problems  we  also  mclude  direct  derivations  of  un¬ 
normalized  forms  [3,5j.  All  det^ed  derivations  are  included 
in  the  appendix(  A-1  through  A-9). 

3  Filtering 

Rewriting  (1)  and  (2),  we  have  the  ^i-semimartingales 
for  a  signal  and  an  observation  process 

Xi  =  Xo  +  f„du  +  ludu  +  (jCf|  +  Qi),  (9) 

Ih  =  Wi  ~  h»do  +  J  A«du  +  (W|  +  5t).  (10) 

The  filtering  problem  is  to  derive 

SCi  =  n.(x.)  =  E[Xi  I  Vi] 

where  y»  =  ff(y<,s  <  t)  =  ff(W„l\f„s  <  t).  Noting  that 
it  —  Ji  }ud»  —  %du  is  a  Vi-innovation  martingale,  it  can 


be  represented  as  the  stochastic  integral  with  respect  to  the 
innovation  process  which  has  two  components  ( for  a  unique 

representation  refer  to  [7.  p284]) 

i/f  =  W,  -  jf  h,dtt. 

(11) 

ltt  =  Ni-  f  X„dti. 

(12) 

Elliott  [1|  observed  that  for  continuous-type  state  and  ob¬ 
servation 


ElXiV,  I  Si]  =  ViElXi  I  Si]  =  ir.y, 
is  a  ]/i-«emimartingale  and  so  its  decomposition,  as  the  sum 
of  a  martingale  and  a  bounded  predictable  process,  is  unique. 
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Thea  he  obtained  two  representations  for  XtUt  ,  compared 
bounded  variations  parts,  and  finally  got  filtering  formulas. 
Applying  this  idea  to  the  general  stochastic  equations  (9) 
and  (10),  the  resulting  normalized  filtering  formula  is  of  the 
form(see  A-1) 

n,(Ar,)  =  no(;!fo)  +  +  ii«(7-)Idu 

Jo 

+  [nu(A’uhu)  —  n«(A^u)nu(/i„)  +  n„(a„)]di/y+ 

[nu-(Au)]  *  [IIu_  (AfaAu)  — ITu- (Auj  +  ITu- (iuA«)]d^u 

(13) 

where  n(_(A,)  heuristically  implies  the  predictable  version 
of  that  conditional  expectation.  This  approach  is  a  little 
different  from  ones  taken  in  [7,8|  and  will  be  used  in  later 
developments. 

This  equation  can  be  simplified  by  introducing  the  ref¬ 
erence  probability  measure  Pq  under  which  the  observation 
process  become  simple.  The  resulting  equation  is  the  linear 
unnormalized  conditional  expectation.  From  (4)  and  the 
Bayesian  rule,  we  have 

where  at(X)  is  the  unnormalized  conditional  expectation, 
Eo  is  the  expectation  with  respect  to  Pa  ,  and  <rj(l)(=  A,) 
satisfies  (see  A-2) 


ff«(l)  ~  ^  ^  9u-(Au  —  l)df«.  (14) 

Applying  the  product  rule  to  ffi{X»)(=  Oj(l)II«(Xi))  gjves 

^i(X()  =  ffo(X()4-  (  0'u(/u +7u)dtt+  [  [<Tu(h»Xu +  aa)]dIVu 
Jo  Jo 

+  f  <r«_[(A„  —  l)X,  +  6«Avjd9u.  (15) 

Jo 

An  alternative  approach  obtains  (15)  directly  from  the  def¬ 
inition  fft(Xt)  =  £o[A|X(  I  ^(|  by  representing  A|X(  by  a 
stochastic  integral  and  taking  conditioning|5|. 

Remark:  In  [5|,  Gertner  first  obtained  <Ti{X)  and  then 
derived  ni(X)  by  the  product  rule.  Thus  he  could  avoid  the 
proof  of  the  existence  of  martingale  representation  theorem. 


4  Smoothing 

The  smoothing  problem  can  be  solved  in  a  manner  simt* 
Qar  to  the  filtering  problem.  We  consider  the  conditional 
expectation,  n,(X,)  =  £(X,  |  1/,]  .where  0  <  s  <  t  <  T. 
Notice  that  for  fixed  s,  n((X,)  is  an  -martingale  and  so 
it  has  a  martingale  representation  with  respect  to  the  inno¬ 
vation  martingale  such  that 

!!«(♦,)  =  n,(*,)  ^  K,^^dD„  +  ^  (16) 

where  K,,a,R,,u  are  -predictable  processes  to  be  deter¬ 
mined.  The  same  procedure  as  for  filtering  results  in  a  re¬ 
cursive  smoothing  equation(see  A-4) 

n,(x.)  =  n.(x.)  +  ^‘[n„(x.h„)  -  n.(x.)n«(h,)idz/„ 


-i-_^‘[n,_(A„)]-‘[n«_(x.A.)  -  ii,-(x.)n.-(A.)!d#*,.  (it) 

For  the  unnormalized  form,  by  the  Bayesian  formula 
TT/v\_E'fv  11/1—  ^o(X,A(  I  l/(|  _  a,,t{X,)  _  <r,,i(X,) 

n.(x.)  -  E[x.  I  y,]  -  -EijATM'  "  ~ 


is  obtained.  Again,  applying  the  product  rule  to  o,^(X.), 
we  obtain  the  unnormalized  form  for  smoothing  (see  A-5): 


«T.^{X.)  =  o..,(X.)+^*<r...(X.A,.)dW^«+^‘ff.,«-((A„-l)X,)dq„. 

(18) 

Notice  that  this  is  a  linear  equation  for  Alternatively, 
(18)  can  be  derived  from  the  definition  <r,j{X,)  =  Ea{X,At  | 

^t]  by  representing  X,A|  by  stochastic  integrals  and  taking 
conditional  expectations  under  Eo  (see  A-6). 


5  Prediction 


The  same  procedure  can  be  applied  to  derive  the  condi¬ 
tional  expectation  of  the  form 


n.(xo  =  E[X,  I  y.\  (19) 


where  0  <  s  <  t  <  T.  For  fixed  t,  11,  (X<)  is  a  y,  -martingale 
,  so  it  has  a  representation  of  the  form 

n.(X.)  =  IIo(X,)  +  r  +  /'  R^d^^  (20) 

Jo  Jo 

where  Ku4,Ru,t  are  integrable  y,  -predictable  processes.  For 
convenience  of  computation  of  the  gains  Kua,Rti,tt  an  aux¬ 
iliary  process  Z,  =  E\Xt  \  ^l,0<s<t<r,  is  introduced, 
which  produces  n,(Xt)  =  E[Z,  |  y,\.  Computing  the  gains 
,  we  can  show  (see  A-7)  that 


n.(xo  =  iio(x.)+  /'[ii«(/»«xo-n.(x.)ii.(h,)+n.(a,)]<i/. 

Jo 

+ fill,.  (A.)]-*  [n._  (A,x.)  -n,-  (x,)n.-  (A.)+n.-  (f«A.)Id#i«. 

(21) 

Using  the  Bayesian  formula  to  get  a  simpler  equation  yields 


n.{x,)  =  E[Xt  I  y.i  = 


g,[X,A.  I  y.l  _  o.,,(X.)  _  o...(X,) 
^o[A.  |1/,1  a...(l)  0.(1)  ’ 


or 


IT  /v^  _  B-r-r  11/1  —  I  y.l  _  Of.,(Z,)  0«.,(X,) 

n.(x.)  -  E[z,\y.\  - 

(22) 

By  a  similar  technique  (see  A-8) , 

0»,,(X,)  =  0|,o(X|)  f  [0|,y(A«X,)  ff»,y(Oy)]dWy 

^  lo  ~  (23) 

is  obtained,  where  <  Z,W  >,=  /,* a,du,  [X,N],  = 

A  direct  derivation  of  (22)  is  also  possible  from  the  definition 
0(,,(Xt)  =  Eo[Z,Ji,  I  y,\  using  the  same  approach  (see  A-9). 

6  Discussion 

Starting  with  the  general  models  of  stochastic  systems 
(  mixed-type  of  states  and  observations),  we  derived  the 
normalized  equations  for  filtering,  smoothing,  and  predic¬ 
tion  of  general  stochastic  equations  usinc  semimartinvale 
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decomposition  techniques.  We  also  showed  that  the  corre¬ 
sponding  unnonnalized  equations  could  lie  derived  in  two 
different  ways(indirect  and  direct  derivation).  Notice  that 
the  same  derivation  methods  result  in  similar  structure  in 
forms.  Thus  the  optimal  estimation  problems  are  unified  in 
these  frameworks. 

7  Appendix 

A-l:  the  derivation  of  equation  (13) 

Conditioning  (9)  with  respect  to  [7l, 

n«(X’j)  =  no(Xi))  +  f  n«(/u + '7i,)du  +  m*  {24) 

JO 

where  m<  is  a  -martingale  and  thus  it  has  the  unique 
representation  vith  respect  to  innovation  processes  vt  and 

m<  =  KudUn  + 
for  {/(-predictable  processes 

Remarks:  the  existence  of  irit  can  be  easily  shown.  And 
in  equation  (24),  the  optional  or  predictable  projections  onto 
observations,  denoted  by  n((X()  and  II(-(X|)  respectively, 
should  be  taken.  However,  it  can  be  shown  that  the  follow¬ 
ing  equality  holds  except  on  the  set  of  didP{u)  measure  zero: 

^  n«(/„  -l-Tfu)dtt  =  ^  n«_(/„  +'iu)du 

Thus  (24)  can  be  identified  under  either  projection.  For 
details,  r^er  to  [7,  p2S3].  In  what  follows,  conditioning  on 
J/t  implies  the  optional  projection  or  predictable  one  as  the 
case  may  be.  By  using  Theorems  1  and  2,  this  martingale 
representation  theorem  with  respect  to  innovation  processes 
can  be  proved.  Although  i/(  and  Ht  are  martingales,  we 
can  not  derive  (24)  by  a  direct  application  <rf  Theorem  2 
because  in  general,  D  <  t).  In  [7],  one  way 

around  this  difficulty  is  to  take  a  measure  transformation 
and  under  the  new  measure  to  get  the  representation  with 
respect  to  observations  and  then  innovations.  For  details 
refer  to  (7,p264]. 

To  find  Ku,  applying  the  product  rule  to  XtWt  , 

XtW,  «  W,(f^du+dM^+dB^]+l* X,(h^dtt+dW,}+J^ a^du. 

Conditioning  on  yt  and  using  the  Fubini-type  theorem, 

E[x,w,  1  y,\  =  n.(x,)w,  =  /'*ii,(x’.h,)dtt  +  /’‘n.(a«)du 

Jo  Jo 

+{yt  —  martingale). 

This  equation  shows  that  TIi(X()yi  is  a  special  semimartin¬ 
gale  which  is  the  sum  of  a  yi-  martingale  and  a  predictable 
bounded  variation  process,  and  furthermore  the  decomposi¬ 
tion  is  unique  [l|.  Whereas,  from  (11)  and  (24) 

TKXtjW,  =  f*  Il,(X^)<ItV^+ f*  W^dIl,(X„)-^r  f  d  <  n(X),W  > 
Jo  Jo  Jo 

Noting  that  d  <  Il{X),W  >(=:  Ktdt, 

TLt{Xt)W{  =  na(Ar«)n,(h«)dtt-(-j^  K„daJr{yt~martingale). 

Since  the  decomposition  is  unique,  comparing  the  bounded 
variation  terms,  we  have 

K„  =  ru(x.A.)  -  n.(x,)ru(h,)  +  n«(o.). 


Similarly,  for  R,,  applying  the  product  rule  to  XtNt, 

X,N,  =  f^  Xu-{Kdu+dg,)+l* N^[Udu+dM,^-dH^]+J\x,.du. 

Conditioning  on  i/t  and  using  the  Fubini-typa  theorem^ 
n<(.X|)iVi  =  f  n^(A^^u)du  -h  f  +•  '7K)dtt 

JO  70 

^  lo  iy*  ~  'n^^ftingale). 

On  the  other  hand,  from  (12)  and  (24) 

n((X()jvi  =  ^  ii..{Xu)dN„+l* N,.dru.[x^)+J^ 'i|n(X),Ar]^. 

Observing  that  d[n(X),B),  *  R,dW„ 

n.(x)/y,  =  /JiL~(x^)n^(x^)du 

+  ^  R,n«_(A«)du-t-^  •^un«(/a-|--y„)du-l-(y(— martinjale). 

The  uniqueness  of  decomposition  yields 

R«  =  in.-(>«)]-‘{n,_(A„x„)-ii,_(A.)iu.(x.)-(-ii,._(i„A„). 

Finally,  substituting  and  R*  into  m«  results  in  (13). 

A-2:  the  derivation  of  0/(l) 

Taking  conditional  expectations  of  A(  under  Po  and  applying 
the  Fubini-type  theorem, 

I  :'(i  =  l+f^Eo[KK  I  j/vldW.-bjT ^[A«-(A.-1)  I  l/„ldq„. 
The  result  follows. 

A-S:  the  derivation  of  <rt(Xt) 

From  (14)  and  (24), 

ff,(X,)  =  n,(X,)A« 

*  no(Ao)  +  nM(X’,)(II«(h,)A«dW ,  +  (n^  (k,)  —  l)  Aady^J 

lo 

+  1^  R,n.(h,)A,da  +  -  l)]i,R,dJV,. 

With  straightforward  calculations," we  get  the  equation  (15). 

A-4:  the  derivation  of  equation  (17) 

Taking  conditional  expectations  of  X.Wt  with  respect  to  y^ 

n((.X’,)W(  =  J  II«(X<h,)du  -b  (y*  —  martingale). 

However,  from  (11)  and  (16) 

II((X’,)W|  =  II„(X,)Il,(h«)du-+- J*  K,,ndu+{yt— martingale). 

By  the  unique  decomposition  of  a  special  semimartingale, 

Jf,,«  =  II,(X,h,)  -  II«(X.)II,(h,). 

Similarly,  conditioning  X,N,  on  l/i, 

n«(A’,)N#  =  J  II«_(X’,A«)  +  [yt  —  martingale). 

However,  from  (12)  and  (16) 
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n,(X.)Nt  =  U.(X.)N.  + 1*  TU..{X.)Tl..{X^)du 

+  i2,.un,_  (Au)<fu  +  (l/t  —  martinffo/e). 

Equating  the  bounded  variation  terms  of  two  representa¬ 
tions  of  n»(^,)fv;, 

Ji..u  =  [n«.(A.)i-‘[ii..(x.A«)  -  n._{^.)n«-(A.)l. 

Then  the  result  directly  followa, 

A-5:  the  derivation  of  equation  (18) 

From  (14)  and  (16), 

cM)  =  ii.(a:.)a, 

=  Ti.{x.)i.+I*  n._  {x.)dl,.+J*  i^<ni.(x.)+J*  d(n(Ar,),  a1„. 

Noticing  that 

d[n(Ar,),A]u  —  n«(/t«)A«i<C,,«du  +  Au(IIv(Au)  —  l)iZ,_y<iUV*, 
With  some  manipulations, 

<r.Ax.)  =  n.(;f,)A.  +  Ayii«(Ar./i,)dw« 

-bj['Ay.[(ny((A.-l)Afy)]d,.. 

The  result  immediately  follows. 

A-6:  the  direct  derivation  of  (18) 

X,A4  =  X,K,+f*X,dA^  =  X.X,+J^  Ar,[AyAydWy+Ay(Ay-l)d9, 
Therefore 

«r..i(A:.)=  WA.11/,] 

=  Eo[Af,A,  I  y,]  +  J*  Eb(AyA,X.  1  l/.]dWy 

+  J*Eo[XM^-i)\yu]dq,. 

This  completes  the  proof. 

A-7:  the  derivation  of  equation  (21) 

Applying  the  product  rule  to  Z,W,,  and  conditioning  on  y, 
produce 

n.(X,)lV,  =  r  IIy(2yhy)do+  Hy  (o.)  (lu +(!/,- mortinpal  c) . 

From  (11)  and  (20), 

n.{x,)w,=  /V.dn,(z,)-i-  rn,(2.)dw.+  [’ K^du 

Jo  Jo  Jo 

=  f  IIy(^H)ny(h«)dtt  +  f  Kujdu  +  {y,  -  martingalt). 

Jo  Jo 

Comparing  the  predictable  processes, 

Ku4  =  ril,(2yhy)du-  n^{ZATU.(K,)du+  riU(a,)dtt. 

Jo  Jo  Jo 

Similarly,  applying  the  product  rule  to  Z,N,,  and  projecting 
this  onto  the  observation  o-fields  produce 

E\z.N.  1  y.]  =  n.{x,)N. 

=  ^  J?[Z«_Ay  I  A/yldtt  +  jf  ny(fyA,)dtt  +  (y,- martingale). 
From  (12)  and  (20), 


n.(X.)lV.  =  /*IIy.(Af.)dlV„+  r  Nu.(ni.{X,)+  f‘Ru^dN^ 

Jo  Jo  Jo 

=  Il«_(Ar,)II,(Ay)du+jf  iRy^IIu_(Ay)ciu+(y,-martinpale). 

The  unique  decomposition  theorem  gives 

=  (n«.(Ay)]-‘[iiy_(A.Zy)-n,-(Af,)n.-(A.)+ny-(f.A.)]. 

Now, 

II.(A..?y)  =  r[Ay£[X,  I  /„]  I  1/y]  =  r[A«Af,  I  1/y]  =  II.(A«Af,). 
Therefore,  from  Ku4  and  Ru^,  equation  (21)  follows. 

A-8:  the  derivation  of  equation  (23) 

From  (14)  and  (20), 

OtAXt)  =  n.(A:,)A.  = 

Ilo(Af,)+  r  Ay.  [Ku4d:'u+Ru4dfiu]+  f  Hy.  (Af|)dAy+  [‘  d(n(Ar,),  A].. 
Jo  Jo  Jo 

Observing  that 

d[lI(Art,A]y  =  AyIIy(liy)iiry^idu  +  Ay— (ny(Ay  —  i)R^dN^f 

vnth  some  computations,  we  get  the  equation  (23). 

A-9:  the  direct  derivation  of  equation  (23) 

From  (22)  and  the  product  rule, 

^(A«  =  2oAo  4-  f  Z^~dLn-\-  f  Ai,-dZy+  f  d[A,  Z]y. 

Jo  Jo  Jo 

Note  that  under  Po,  [A,  Z]»  =  0.  Applications  of  the  Fubini- 
type  theorem  gives  (23). 
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ABSTRACT 

This  paper  considers  a  singularly  perturbed  hybrid  system  whose 
state  equations  depend  on  a  near  decomposable  finite  state  Markov 
process  with  fast  transitions.  The  limiting  behavior  of  the  fast  and  slow 
mode  subsystem  for  the  duration  of  intervals  of  fast  transitions  within 
each  group  is  investigated.  The  results  are  shown  to  hold  when  the 
process  is  near  decomposable,  ergodie,  time-reversible  and  stationary, 
and  the  values  of  the  system  matrices  within  each  group  commute. 

I.  INTRODUCTION  AND  PROBLEM  FORMULATION 

This  paper  examines  the  limiting  behavior  of  a  singularly  perttubed 
hybrid  system  with  switched  parameters  which  depend  upon  a  near 
decomposable  finite  state  Markov  process  (FSMP).  The  state  space  of  a 
hybrid  system  is  a  cross-product  space  of  a  Euclidean  space  and  a  finite- 
state  space.  Basically,  hybrid  systems  are  linear,  piece-srise  constant, 
time-varying  systems,  which  are  ssritching  among  a  finite  number  of 
constant  realizations.  Hybrid  systems  have  already  been  considered, 
and  their  properties  are  well  documented[l].  The  use  of  the  aggregated 
models  to  describe  global  features  of  singularly  perturbed  FSMPs  is 
studied  in  [2j.  Preliminary  investigation  of  singularly  perturbed  hybrid 
systems  and  singularly  perturbed  FSMPs  is  reported  in  [3],  Here  we 
extend  the  latter  results,  but  consider  the  decoupled  case  where  the 
switching  is  a  near  decomposable  FSMP  as  discussed  in  [4], 

The  system  models  under  consideration  are  assumed  to  have  the 
following  state  equations: 

i(t)  =  A,(r(01*(<)  (1) 

nHt)  =  A,(K<)Is«)  (2) 

where  >i  >  0  is  a  small  parameter,  c(()  €  R"  represents  the  slow  mode 
of  the  system  and  r(()  €  R*  the  fast  mode,  the  process  r(t)  is  the 
fijrm  index  which  takes  values  in  S  =  {1,2,  and  determines  the 

system  model  at  a  particular  time.  The  process  r(t)  is  modeled  as  an 
FSMP  which  contains  N  groups  of  strongly  interacting  states,  where 
group  j  consists  of  ny  fast  states  and  Z;wi  "1  =  Matrices  Ai(r(t)] 
and  A]{r{()]  are  random  through  their  dependence  on  the  values  of  the 
process  r(().  The  current  values  of  Ai[r{()I  *>  denoted  by  an  index, 
for  example,  for  i  =  1, 2,  A‘j„  will  denote  /L(r(<)]  when  r(t)  s  m  € 

group  j,  ;  s  1 . ,V,  m  =  1 . ny.  Let  the  evolution  of  the  process 

r(<)  satisfy 

^=PW{iF  +  G)  (3) 

where  P(t)  is  an  M  x  Af  transition  probability  matrix  at  tima  t.  It  is 
assumed  that  0  <  r  <  1,  matrices  F,  G  and  +  G  m  generators 
such  that  the  process  tit)  has  a  single  ergodie  class.  Furthermore,  let 
each  of  the  N  groups  be  a  FSMP  with  a  single  ergodie  time-reversible 
ciass(5].  The  generator  of  the  jth  group  Fj  is  the  jth  block  in  the  block- 
diagonal  matrix  F.  Thu  paper  considers  the  stationary  case,  namely 
that  the  process  rft)  has  reached  its  steady  state. 

The  following  is  the  outline  of  the  paper.  Section  11  summarizes 
asymptotic  behavior  of  a  near  decomposable  FSMP.  In  Section  HI, 
the  approximate  model  iy(()  for  the  slow  mode  subsystem  Zj(()  for 
the  duration  of  intervals  of  transitions  among  the  fast  states  of  group 
j  is  derived.  The  probabilistic  averaging  procedure  is  adopted.  The 
mean-squared  error  between  iy(t)  and  Zj(()  is  studied.  In  Section  fV, 
we  study  the  lirmting  behavior  of  the  fast  mode  subsystem  when  both 
U  and  <  tend  to  0.  Section  V  considers  an  example  to  illustrate  the 
methods  used  in  Section  III  and  IV.  Section  VI  concludes  the  paper. 

^  THm  rwsirh  !■  lupponed  bf  the  U.  S.  Air  Force  under  srsne  AFOSR-aS-OZSl. 


n.  ASYMPTOTIC  BEHAVIOR  OF  A  NEAR  DECOMPOSABLE 
FSMP 

In  this  section  we  summarize  the  asymptotic  behavior  of  a  near  de¬ 
composable  FSMP.  Since  the  process  r{t)  satisfies  the  MSST  {mnlttple 
semuUitlify)condition[2],  it  hss  only  two  time  scales.  In  order  to  con¬ 
struct  an  asymptotic  approximation  of  the  process  r(t),  one  establishes 
the  following  proposition: 

Tiomma  1  Let  =  limi_oa  explPI).  Then 

n=dtag(j]j . nAf] 

WilA  rij  ®  =  for  eome  rij-Jimettsional  rowvector 

Ej  nek  that  Ej  =  [eyi . sad  Ej  lj  =  1. 

Furthermore,  define  the  MxN  matnz  V  and  the  N  x  M  matrix  U 
as  fotlova: 

V  =  diag{li,...,ln],U  =  diag[Ei . En\.  (4) 

then 

VU  =  Yl'UV  =  I 

where  I  it  an  S  x  N  identitg  matrix. 

We  now  use  fJ,  V  and  U  to  construct  a  uniform  asymptotic  approxi¬ 
mation  of  the  process  r(t)  as  shown  in  the  following  lemma: 

Lemnui  2  Attume  that  0  <  r  <  1,  then 

P(«)=sexj)(P(l-to)/«)+ 

Vexv(UCV{t~to)U -11+0(1)  (5) 

sat/srmfy  vsltd  for  t  >  (q. 

Obviously,  when  t  -•  0,  the  process  r(()  can  be  replaced  by  an 
aggregat^  process  f(t)  taking  values  in  S  s  {1,...,N}.  Let  P(t)  be 
the  transition  probability  matrix  of  the  process  r(().  Then 


P{t)  tx  exp(U GV(t  - !«)),  for  t>  to  (8) 

and  UGV  is  the  generator  of  the  process  r(t)  and 

P(f )  s  VP{t)U,  for  t>  to  (7) 

Note  that  the  process  f(()  is  stochsstically  continuous  and  has  a  single 
ergodie  eiass  such  that 

,||m  R(<)  =  I  (8) 

Thus  Equation  (7)  can  be  interpreted  as  follows: 

Pr{r(t)  =  m  6  group  ;|r((o)  =  k  €  group  «} 

*ey„  Pr(r(t)=>|f(to)  =  »]-bo(l).  (9) 

for  t  >  to  and 

^lim  Pr(r(t)  =  m  C  group  j 

\r(to)  =  k  e  group  i]  =:  ej  ejm  (10) 


where  eym  is  the  component  of  the  ergodie  probability  vector  ej  corre¬ 
sponding  to  state  m. 

Furthermore,  if  we  express  (S)  in  the  fast  time  scale  r  =  (f  -  to)/e, 
we  obtain 

Lemma  3  Let  r  s  (1  -  (o)/r.  We  have 


P(r)  =  P((r)  =  erjKfr)  +  0(f)-  dD 

Smtlarif,  the  cfmatioa  for  the  Iraajiiio*  proiaittUf  matnz  of  r(r)  ke- 
eomu 

=P(r)(/-  +  fG)  (12) 

aT 

From  (11)  and  (12)  it  foUowa  that  the  influence  of  weak  interaction  tG 
in(12)  will  become  signiflcant  after  a  long  period  of  time  r.  Aaaume 
that  r(0)  €  group  j,  then  we  have 

Pr{r(r)  =  n  6  jroup  j|r(0)  =  m  €  group  i} 

=  pLn+0(t)  (13) 

where  ia  the  (m,  n)th  element  of  ezp{Fjr). 

In  this  work,  the  iteady-etate  diatribution  of  the  ptoeeaa  r(()  will 
be  needed.  Thia  probability,  ftom  (10),  ia  defined  by  eje/m,  j  — 

1 . N,m  =  1 . ny.  Henceforth,  it  ia  aaaumed  that  the  proceaa 

r(()  haa  reached  ita  ateady  atate,  in., 

Pr{r(«)  =  m  €  group  i\  =  tjtjm  (1<) 

With  the  above  aaaumptiona,  we  conclude  that  in  the  faat  time  acaie 
r,  0  <  (  <  1,  jumpa  among  diflerent  groupa  occur  after  a  very  long 
period  of  time  r  ao  that  the  following  reaulta  are  obtained: 

Lemma  4  Since  group  j  ia  a  fiuita  trgodie  alaiiouarg  time  reueraUle 
Markov  chain,  then  the  expectation  of  A\r{T)\  ia  group  j  ia  given  hg 

a}  H  r{Ai[r(r)]|r(r)  €  group  j] 

=  i  ,  (IS) 

m«l 

and  iti  autoeovenance  function 

e;(»-)  =  i^/liWr)].4i(r(r  +  a)llr(r) 
and  tir  +  a)  e  group  j]  ~  {A})^ 

=  H  -  ejn)Aj„  (18) 

flMl  fial 

We  next  conaider  the  additive  proceaa 

V}(r)  ss  jT  Ailr{a)  €  group  j]da.  (17) 

For  large  (,  we  would  expect  that  the  proceaa  Yj(r)  ia  aaymptotically 
normal  in  diatribution,  and  aatiafies  a  central  limit  theorem.  Before 
doing  so,  the  next  lemma  ia  required: 

Lemma  S  Cet  c’g,  Fj  the  fundamental  matrix  of  group  j  and  Cj  he 
defined  aa  foUowa 

<0  =  diajfcji . tjn,],Fj  =  (/’iJ 


Cj  =  /  ej  ( r)dr  =  . . J  x 

[eDF9Una][{A],f . {A]^fV  (18) 

where  9  denotea  the  Krvneetor  produet[6j. 

Then  C,  la  atncilg  poaiine  definite  unleaa  Aj„  ia  independent  of  m 
and  Cj  la  zero  if  Aj„  ta  independent  of  m. 

The  poaitivity  of  C,  haa  a  natural  meaning  in  that  it  corresponda  to 
the  variance  of  y)(r)  in  the  central  limit  theorem. 

Lemma  6  LetY,{T)  =  /'  .4i[r(#)  6  group  ]\da.  Then  Yj{r)  ta  aagnp- 
toiieallf  normal  m  diatnhntion  for  large  r  and  aatiafiea  the  central  limtt 
theorem,  t.e., 

Yj(r)  —  iV(Ajr,2C,r),  aa  r  —  ao.  (19) 


Meanwhile,  Yj(r)  alao  aatiafiea  a  week  law  of  large  number  such  that 


Y,(r)  a 


(20) 


For  proof  of  the  above  Lemmaa  one  can  see  aee  [2],  [p.  118,  [5]]  and  [7], 
The  central  limit  theorem  ia  uaeful  in  that  we  exploit  it  to  derive 
the  approximate  model  of  the  slow  mode  subsystem. 


III.  SLOW  MODE  SUBSYSTEM 


This  section  considers  the  limiting  behavior  of  the  alow  mode  sub¬ 
system.  Let  Zj{t)  denote  the  atate  of  the  subsystem  when  r(t)  takes 
values  in  group  j.  The  trajectory  of  Zj{t)  is  baaed  on  writing  the  solu¬ 
tion  of  x(t)  aa  a  function  of  the  fast  states  in  group  j  for  the  duration 
of  the  intervals  of  transitions  among  the  faat  states  of  group  j.  The 
solution  Zj(t)  can  be  considered  aa  a  standard  state  equation  solution 
of  a  time  varying  linear  system.  The  time  varying  nature  stems  Born 
the  dependence  of  the  system  matrices  on  the  different  values  of  the 
fast  states  of  group  j.  In  what  follows  we  use  probabilistic  averaging  to 
derive  the  approximate  model  for  Xj{t).  Given  the  group  of  the  pro¬ 
cess  r(f )  (i.e.,  group  J  )  and  the  state  of  the  systems  (1)  at  time  t  (i.e., 
Xj(l)),  the  expected  atate  at  time  ( -f  A  where  A  is  of  order  e  ia  to  be 
computed.  Thus 

Xj(t  +  A)S  E{xj(t  +  A)lx,(«)  6  group  j] 

■i 

=  ^  t*J>(Al^A){ej„ej/ei)xj(t) 

mml 

=  ^  e^A}„A)ej„Xj(t)  (21) 

From  Equation  (21)  one  deduces  the  dynamics  of  Z;(f )  as- 

ij(0 

=  Um  Et*i(*  +  ft)  -  *j(0l’‘(0  €  group  j]/ft 

®  A>m*im)5;(l) 

ntttl 

=  Ajii(t) 

where  A}  ia  the  statistical  average  value  of  v4t  [r(t)]  when  r(t)  switches 
among  the  fast  states  of  group  j.  In  what  follows  we  shall  show  that  the 
mean-squared  error  between  ij(t)  and  Z;(t)  tends  to  sero  as  e  tends  to 
zero. 

Tbrnezrenn  1  Suppoae  that  A}„A}„  =  Aj^A}^,  m,n  e  {l..-,n,]  and 
Aj  ia  atahle.  Then 

Um  EOI*j(0-*;(OII’ 


M*)  6  group  ;,  a  €  [to,  t]]  —  0  (22) 

where  ||  •  ||  deselea  the  Euclidean  norm. 

Proof.  Assume  that  the  process  r(t)  switches  to  group  j  at  time  tg.  We 
express  the  systems  (1)  in  the  fast  time  acaie  r  =  (t  —  to)/e  to  obtain 

Xi(r)  =  eAMT)]Xi(r).  X,(0)  =  Zy(fo)  (23) 

With  the  assumption  that  =  A]„Aj„,  we  have 

Xj{r)  =  t’fo  ''•HOSir-r  '’^s,(lo) 

On  the  other  band,  we  express  Z;(t)  in  the  fast  time  scale  as 

.Y,(r)  =  e^’.‘"z>(lo)  (24) 

To  prove  the  theorem,  there  are  two  cases  to  consider: 

Caae  I:  Let  depends  on  the  value  of  m.  The  difference  between 
X,[r)  and  .Y,(r)  is  given  by 
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Xj(r)  -  x,(r)  =  ItWrt-A'r,  _  I]ei!>-I.(ta)  (25) 

For  a  given  t,  r  —  oo  aj  <  —  9.  It  followe  that 

£Q|e.(r,(n-i}r)  _  i||i] 

=  trace  £^e=‘(>'<(')-*!r)  _.2e‘<»''<’’'-'‘!’’>  -  1] 

fia««<n>0 

=  0(<) 

and  ||e^'‘’'ll  <  0.  Hence, 

£tll-Xy(r)  -  ,Xi(r)ll’lr<i)  €  srronp  j.s€  l«a.<n 

=  0(t)  (2«) 

Taking  the  limit  <  —  0,  we  obierve  that  the  right-hand  side  of  Equation 
(26)  tends  to  zero.  A  change  of  variable  yields  the  result. 

Case  //.■  Let  Aj„  be  independent  of  m  and  =  Aj.m  = 

Lemma  5  then  yields 

a}  =  a}  and  Cj  =  0 

Obviously,  Xj(r)  =  Xj(r),  i.e.,  there  is  no  error  between  Xj(r)  and 

Xj(r). 

The  resulting  approximation  implies  that  the  slow  mode  subsystem 
can  be  approximated  by  a  hybrid  system  depending  on  the  aggregated 
Markov  chain  f(t)  taking  values  in  S.  The  system  model  of  the  hybrid 
system  at  a  particular  time  t  it  a  statistical  average  of  the  system 
matrices  over  their  values  based  on  the  group  that  the  process  r(() 
takes. 

rv.  FAST  MODE  SUBSYSTEM 


In  this  section,  we  study  the  limiting  behavior  of  the  fast  mode 
subsystem: 

^i(t)  a  A,(r(t)]z(t)  (27) 

Assume  that  ||A^„||  a  0(1),  j  a  l,.,.,.y,  mat,  ...,n/  and  that 
r(()  se  m€  group  j.  To  analyse  the  state  z(<),  the  stretched  time  scale 
t,  where  f  a((  —  is  used.  Hence,  expressing  the  systems  (27)  in 
the  stretched  time  scale  t  yields 

Z(«)  a  A,[r(t)lZW.  Z(0)  a  ,(«,).  (28) 

Similariy,  the  equation  for  the  transition  matrix  of  r(9)  becomes 

^  a  P(«)(i  f  O),  P(0)  a  I.  (29) 

The  limiting  behavior  of  the  systems  (28)  depends  on  the  rslstivs  sim 
of  ft  and  <  as  they  both  tend  to  zero.  There  are  three  cases  to  consider. 
Csss  A  /i  a  g(e)  and  iim,..4  * 

In  this  ease  we  have 

a  0(t),  P(0)  a /.  (30) 

Thus,  in  the  stretched  time-scale  9,  the  transitioo  probability  matrix 
tends  to  a  coutant.  Since  P[9)  is  an  identity  matrix,  sre  observe 
that  transitions  among  states  of  the  process  r(8)  are  very  slow,  ia.. 
r(0)  a  m  ggroup  j  implies  r(4)  a  m  €  group  j  for  all  9  when  <  — •  0'^. 
Under  this  condition  the  system  matrix  for  t^  state  variables  r(l)  is 
A^^  when  r(0)  a  m  g  group  j.  Similar  to  Section  11,  the  solution  of 
Zj(t)  of  (28)  is 

Z,(9)  s  exp(Aj^9)i{ta)  (31) 

which  when  transformed  to  the  normal  time  scale  t  yields 

ijit)  a  exp(Aj„it  -  to)/ft)ii{to),  for  t  >  to.  (32) 

If  all  the  values  of  are  stable,  then  Zjit)  can  be  approximated  by 
(32).  The  following  theorem  summarizes  the  case. 


Theorem  2  Assamc  that  ft  =  o(()  and  Iim,_op/(  —  0.  all  the  valnet 
afAj„  art  stable,  and  r(lo)  a  m  €  jrosp  ;.  Then  at  i  —  O 

!j(t)  a  ezp(A*„(l  -  lo)/M)rj(lo),  for  t  >  t,.  (33) 

Cass  If.  ft  a  0(e)  and  lim,— o>i/<  —  i  where  i  a  0(1). 

In  this  case  we  hare 

a  P{9)(kF  +  0(e)).  P(0)  a  I.  (34) 

From  (34)  it  follows  that  ,  in  the  stretched  time-scale  8  as  <  —  0, 
no  jumps  occur  among  groups  and  r(8)  suya  within  group  j  when 
r(0)  a  m  €  group  j.  While  r<l)  takes  values  among  the  fast  states 
of  group  j,  then  :j(<)  behaves  approximately  as  any  time  invariant 
systems  with  constant  system  matrix  held  to  their  values  at  the  last 
transition.  The  following  theorem  addresses  the  problem  for  the  case. 

Thaoewm  3  Attnme  that  ft  a  0(e)  and  Iim,_m  ft/e  —  k  where  k  a 
0(1).  Denote 

lf»«l 

Then 

1.  In  the  ttretehed  time  teale  9,  tf(9)  it  arpnxmetelt  modeled  at 
ea  aalenemosu  Ayknd  system  (t9)  dependent  on  the  fast 
states,  of  /mp  j,  whose  tmnsiiion  fnbabilUp  metnx  is  given  bp 

P'(9)  a  exp(Fjk9) 

t  If  the  values  af  A^  art  stable,  then  the  solution  of  Zj(t)  is  aai- 
formlp  aspmptoticullp  stable. 

Note  that  logarithmie  aorm[l]  can  be  used  to  check  the  stability  con¬ 
dition  of  Aj. 

Cate  III.  ft  a  fOiohere  0  <  a  <  land  lim,  .„  ft/e go. 

Siinilariy,  we  have 

^  ^  **(0)  “  (35) 

which  implies  that  transitions  among  the  Markov  states  within  groups 
are  very  Cmt  and  that  transitions  among  gtoups  are  very  alow  as  e  -•  0. 
Thus,  r(9)  only  takes  values  of  group  j  if  r(0)  a  m  €  group)  and  2)(8) 
can  be  approximated  as  an  averaged  system  with  a  constant  aystem 
matrix. 

Tbsorezn  4  Suppose  that  ft  me"  where  0  <  a  <  1  sad  liiti,_a  ft/e  — 
00  sad  r(to)  yreep  Assume  that  all  values  of  A^  commute  with 
each  other  and  are  stable.  Then 

I.  Zf(9)  can  be  approximated  at  sa  sverafed  system  whose  dyaam- 
iest  eyastseas  sre  described  bp 

ii(9)  =  A]2i(9).  2i(0)  =  r,(t.)  (38) 

8.  The  mcaa-sfsared  error  between  Z/(9)  and  2/(9)  tends  to  sera 
as  t  tends  to  ten. 

S.  The  solution  of  Zf  (9)  it  unifomlp  aspmptotiettlp  stable. 

V.  AN  EXAMPLE 

An  example  to  illustrate  the  method  in  Section  II  and  III  is  demon¬ 
strated  here.  The  system  is  given  by  (l)-(2)  where 


A},  3 

2 

-4 

'  ^ 

-3 

1 

-.1 

diJl  = 

[V 

1 

-5 

.  Aj,  3  ^ 

-6 

2 

1 

-4 

A\x  = 

■  -6 
.  ‘-5 

3 

-12 

.  = 

4 

-10 
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-10  2 
4  -6 


and  the  FSMP  r{t),  shown  in  Fig.l.  which  consists  of  two  groups,  each 
of  which  contains  two  strongly  interacting  states,  has  the  following 
generators: 

■  -1  1  0  0  ■ 

_  1  -I  0  0 

0  0  -2  2  ’ 

0  0  3  -3 
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Hence,  from  the  definition  of  Aj  and  .dj,  j=1.2,  we  have 


'  -2.5 

2  1 

ii  r 

-6.4 

1  1 

1 

-4.5  J 

.  A,=  [ 

2 

-4.4  J 

A?  = 

'  -4.4  3.5 

,  1.75  -11.2  , 

.  Ajw 

f  -8.8 

[  3.6 

1.8 

-5.2 

Fig.2  and  Fig.3  show  the  sample  trajectories  of  xi(()  and  zi((), 
when  the  initial  conditions  zi(0)  =  [2.0,3.01^,  under  three  cases:  (i) 
c  =  0.1,  (ii)  e  s  0.01,(iii)  e  =;  0.001.  Obviously,  the  error  betsreen 
zi(0  and  zi(()  is  smaller  as  r  becomes  smaller.  Similarly,  Fig.4  shows 
the  sample  behavior  for  ri(()  and  ii(<],  with  ri(0)  =  [2.0, S-Op*  and 
<  =  0.1.  The  errors  do  become  smaller  when  <  goes  decreases. 

VI.  CONCLUSIONS 

This  note  considered  the  limiting  behavior  of  a  singularly  perturbed 
decoupled  hybrid  system  whose  state  equations  depend  on  a  near- 
decomposable  finite  state  Markov  ptoeeas.  The  limiting  system  be¬ 
havior  of  the  slow  mode  subsystem  for  the  duration  of  intervals  of  fast 
transition  within  each  group  can  be  approximated  by  an  averaged  value 
of  the  system  matrix  over  aU  their  values  baaed  on  the  fmt  states  of 
the  group.  The  limiting  behavior  of  the  fast  mode  subsystem  tor  the 
duration  of  intervals  of  fast  transitions  within  each  group  depends  on 
the  relative  site  of  and  r  when  both  p  and  <  tends  to  xero:  (i)  the 
system  can  be  approximated  as  a  time  invariant  system  with  the  con¬ 
stant  system  matrix  held  to  the  value  at  the  initial  transition  when 
p  s  o(<),  (ii)  the  system  can  be  modeled  as  a  hybrid  system  depending 
only  on  the  fast  states  of  the  group  when  p  =  0(r),  (iii)  the  system 
can  be  approximaced  as  an  averaged  value  of  the  system  matrix  over 
all  their  values  based  on  the  fast  states  of  the  group  when  p  =  <*  where 
0  <  a  <  1.  The  results  need  two  erudal  assumptions,  the  ergodicity 
,  stationary  distribution  and  time  reversibility  of  the  process,  and  the 
fact  that  the  values  of  the  system  matrices  within  each  group  commute. 
The  results  hold  even  if  the  aggregated  process  has  an  absorbing  state. 

Additional  work  it  needed  concerning  the  relaxation  of  the  restric¬ 
tions  of  commutation  of  the  system  matrices  within  each  group.  The 
Baker-Campell-Hausdorff  (BCH)  formula  [1,8)  appears  to  be  a  promia- 
ing  feature  in  this  direction. 

rinally  adapting  the  results  of  the  note  to  the  more  general  singu¬ 
larly  perturbed  (stochastic)  hybrid  systems  where  the  twitching  it  a 
near-decomposable  finite  state  Markov  chain  may  be  usefuL 
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Fig.l  The  Process  r(t) 


Fig.2  The  Sample  Trajectory  of  Xn(f)  and  iii(f) 
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Fig.4  The  Semple  Trajectory  of  ru(0  and  iu(t) 
The  Sample  Trajectory  of  rij(t)  and 
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ABSTRACT 


This  paper  considers  a  singularly  perturbed  hybrid  system  whose 
state  equations  are  governed  by  a  stochastic  switching  process,  which 
is  singularly  perturbed  and  is  modeled  as  a  near  decomposable  con¬ 
tinuous  time  finite  state  Markov  chain  (FSMC).  The  decomposition 
of  the  system  and  the  switching  process  together  into  slow  and  fast 
subsystems  is  investigated.  An  approximate  model  for  the  slow  sub¬ 
system  over  the  interval  of  fast  transitions  within  each  group  is  de¬ 
rived  and  the  mean-squared  error  between  the  model  and  the  actual 
subsystem  is  quantified.  The  stability  of  the  slow  mode  subsystem 
is  studied  and  two  stability  criteria  are  introduced.  The  behaxnor 
of  the  fast  subsystem  depending  on  the  relative  size  of  perturbation 
parameters  is  analyzed.  Finally  an  example  is  used  to  illustrate  the 
aforementioned  techniques. 

1.  INTRODUCTION  AND  PROBLEM  FORMULATION 
1.1  Introduction 

This  paper  studies  the  asymptotic  behavior  of  the  trajectory  of 
a  singularly  perturbed  hybrid  system  whose  state  equations  depend 
on  a  near  decomposable  continuous  time  finite  state  Markov  chain 
(FSMC).  The  state  space  of  a  stochastic  hybrid  system  is  a  cross 
product  of  an  Euclidean  space  and  a  finite  discrete  state  space.  Ba¬ 
sically,  stochastic  hybrid  systems  are  a  special  type  of  linear,  piece- 
wise  constant,  time  varying  systems  which  switch  randomly  among 
a  finite  number  of  linear  time  invariant  models.  The  switching  be¬ 
haves  like  an  FSMC.  Such  systems  have  been  successfully  used  to 
model  pilot  commands  in  target  tracking,  isolation  levels  of  solar 
receivers,  and  systems  subject  to  sudden  changes  in  their  structure 
and  parameters  which  are  caused  by  phenomena,  such  as  comper- 
nent/sensor  failures  or  repairs,  abrupt  environmental  disturbances 
and  changing  syrtem  interconnections  in  manufacturing  systems  and 
large  scale  flexible  stnictures  [1].  This  paper  is  concerned  with  the 
a.symptotic  approximation  of  singularly  perturbed  stochMtic  hybrid 
systems  when  both  the  continuous  states  and  the  switching  process 
are  singularly  perturbed.  The  study  of  systems  of  this  type  is  moti¬ 
vated  by  new  applications,  such  as  analysis  of  singularly  perturbed 
systems  containing  quantized  elements  or  on-olf  control  (2],  and  sim¬ 
plified  filtering  schemes  for  singularly  perturbed  switched  parameter 
systems  (3). 

Singular  perturbation  methods  in  [d-O]  are  used  in  the  paper  to 
decompose  the  continuous  states  and  the  switching  process  together 
into  slow  and  fast  mode  subsystems.  The  methods  alleviate  the  prob¬ 
lems  of  stiffness  difficulties  resulting  from  the  interaction  of  slow 
and  fast  dynamics.  Singularly  perturbed  FSMCs  and  singularly  per¬ 
turbed  stochastic  hybrid  systems  have  been  investigated  by  several 
researchers  in  (3.7-101.  Aggregation  methods  were  used  to  describe 
global  features  of  singularly  perturbed  FSMCs  [7-8).  In  (9)  the  au¬ 
thors  developed  aggregation  and  averaging  ideas  to  deal  with  approx¬ 
imation  of  stochastic  hybrid  systems  in  which  the  switching  process 
depends  on  its  current  discrete  state  and  the  continuous  states.  An¬ 
other  study  l-l)  examined  the  limiting  behavior  of  a  class  of  singularly 
perturbed  stochastic  hybrid  systems  where  the  switching  process  is 
singularly  perturbed  and  independent  of  the  continuous  states.  The 
authors  in  (I0|  have  generalized  the  results  of  (3)  by  allowing  a  much 

broader  class  of  the  switching  process  which  consists  of  many  groups 
of  strongly  inter.acting  discrete  states.  In  [10]  an  approximate  model 
for  the  slow  mode  subsystem  within  each  group  is  derived  base  on  the 
probabilistic  averaging  procedure.  Its  accuracy  is  quantified  with  the 
restriction  of  commutability  of  the  system  matrices  of  all  realizations 
within  each  group. 

Our  aims  here  are  to  apply  the  aggregation  method  to  derive  an 
approximate  model  for  the  slow  mode  subsystem  over  the  interval  of 
fast  transitions  within  each  group  of  the  switching  process,  to  quan¬ 
tify  the  accuracy  of  the  approximate  model  without  the  restriction 
imposed  by  the  proofs  derived  in  [3,10],  and,  finally,  to  analyze  more 
general  singularly  perturberl  stochastic  hybrid  systems. 
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1.2  Problem  Formulation 

The  system  models  under  consideration  are  assumed  to  have  the 
following  state  equations: 

*(0  =  ^iHOWOd- ^Ij['’(<)]^(0ia:(fo)  =  *0,  (1) 

=  >lj,(r(01i(0  +  >lj(r(t)]z(t),  z(fo)  =  rp.  (2) 

where  ^  >  0  is  a  small  parameter,  x(f)  €  W  represents  the  slow  mode 
of  the  system  and  z(()  €  R'  the  fast  mode,  and  the  process  r(t)  is  the 
form  index  (or  plant  mode)  which  takes  values  in  S  =  {1,2,  ....,m} 

'  and  determines  the  system  model  at  a  particular  time.  All  matrices 
are  of  proper  dimensions  and  are  random  through  their  dependence 
on  the  values  of  the  process  r(f). 

'  We  further  assume  that  r(t)  can  be  modeled  as  a  near  decompos¬ 
able  FSMC  which  contains  n  groups  of  strongly  interacting  states, 

I  where  the  ith  group  consists  of  n,-  fast  states  which  form  the  subset 
'  Si  =  {^^i  -I-  j\  }  =  1, ...,  n,},  where  M  =  J3*=t  ^  2  and  A/)  =  0. 

,  Note  that  J2r=i  U?=i  =  S.  The  current  values  of  the 

system  models  are  denoted  by  an  index,  for  example,  will  de¬ 
note  /tt(r(l))  when  r(/)  =  j,j  €  Si,  where  it  =  1,2,12,21.  Let  the 
evolution  of  the  process  r(l)  satisfy 

^  =  P(t)(lj  +  G)  (3) 

where  P{t)  is  an  m  x  m  transition  probability  matrix  at  time  t.  It  is 
assumed  that  0  <  t  <  1,  matrices  F,  G  and  +  G  are  generators 
such  that  the  process  r(t)  is  irreducible.  Furthermore,  let  each  of  the 
n  groups  be  an  irreducible  and  time  reversible  FSMC  with  generator 
Fi.  The  generator  Fi  is  the  ith  block  in  the  block-diagonal  matrix 
F.  This  paper  considers  the  stationary  case,  namely  that  the  process 
r(t)  has  reached  its  steady  state.  It  is  also  assumed  that  z(t),  z(t) 
and  r(()  are  perfectly  observed. 

The  behavior  of  the  overall  system  depends  on  the  relative  size 
of  li  and  (  as  they  both  are  sulficiently  small.  There  are  three  cases 
to  consider  :  (i)  /i  =  o(c),  (ii)  fi  =  0(c),  (iii)  c  =  o(/i).  Furthermore, 
due  to  the  possibility  of  transforming  the  system  with  Eq.  (l)-(2) 
into  a  decoupled  system,  we  shall  focus  on  the  decoupled  case  (i.e., 
A|}(r(()]  =  0,  and  i4]i[r(()]  =  0),  and  then  extend  the  results  to  the 
coupled  case. 

The  following  is  the  outline  of  the  paper.  Section  II  presents  the 
basic  mathematical  tools  for  the  behavior  of  a  near  decomposable 
FSMC  and  the  properties  of  additive  processes.  Section  III  studies 
the  limiting  behavior  of  the  decoupled  slow  mode  subsystem.  An 
approximate  model  for  the  slow  mode  subsystem  over  the  interval 
of  fast  transitions  among  the  fast  states  of  each  group  is  derived  in 
Section  3.1  based  on  the  aggregation  method.  Section  3.2  quantifies 
the  mean-squared  error  between  the  approximate  model  and  the  ac¬ 
tual  system.  The  asymptotic  stability  of  the  decoupled  slow  mode 
subsystem  is  explored  in  Section  3.3.  In  Section  IV,  the  limiting  be¬ 
havior  of  the  decoupled  fast  mode  subsystem  is  investigated  when 
both  fi  and  c  tend  to  zero.  Section  V  examines  the  coupled  case. 
Section  VI  considers  an  example  to  illustrate  the  methods  used  in 
Section  V.  Section  VII  concludes  the  paper. 

II.  MATHEMATICAL  PRELIMINARIES 

This  section  surveys  some  notations  and  results  concerning  the 
asymptotic  behavior  of  near  decomposable  FSMCs,  and  the  prop¬ 
erties  of  additive  processes.  These  results  play  important  roles  in 
studying  the  asymptotic  behavior  of  the  trajectory  of  the  system 
given  by  Eq.  (l)  (3). 

2.1  Near  Decomposable  FSMCs 

In  the  following  the  asymptotic  behavior  of  a  near  decomposable 
FSMC  is  summarized.  There  have  been  a  number  of  studies  in  the 


literature  concerned  with  the  asymptotic  approximation  and  aggre¬ 
gation  of  a  singularly  perturbed  FSMC  (7-8).  The  fast  transient  of  . 
the  process  r(()  is  formed  of  separate  transients  within  the  strongly 
coupled  groups.  Over  a  longer  period,  eau;h  group  of  the  strongly 
coupled  states  can  be  treated  as  an  aggregate  state. 

Let  the  switching  process  r(£)  satisfy  the  MSST  {multiple  semista-  ; 
bility)  conditiou(6|.  Thus,  two  time  scales  is  suflicient  to  describe  the  i 
glol>.al  evolution  of  the  process  r(t).  To  analyze  the  process  r(J),  one  ■ 
needs  the  following  notations  (6,10)  :  I 

Let  n  =  lim,_oo  ezp(^’’)-  Then 

n  =  diujirii.- -.nni 

where  Hi  =  ‘  =  I.— i'll  is  an  n«-dimensional  row  vector 

and  Ei  =  [<,>],  j  €  Si,  and  1,  is  an  n,-  dimensional  column  vector 
with  the  same  elements  1.  Furthermore,  define  an  m  x  n  matrix  V 
and  an  n  X  m  matrix  U  as  follows:  j 

V  =  =  diag[Eu---,En]-  (4)1 

then  VU  —  Y[,  UV  =  /,  where  I  is  an  n  x  n  identity  matrix.  ! 

For  fast  transient  analysis  of  r(£),  Eq.  (3)  can  be  expressed  in  ; 
the  stretched  time  scale  r  =  to  obtain 

P{r)  =  (5)  : 

For  aggregate  analysis  of  the  process,  r(£)  can  be  replaced  by 
an  aggregate  process  f(£)  taking  values  in  5  =  {l,...,n}.  Let  P{t) 
be  the  transition  probability  matrix  of  the  process  f(£).  Then  the 
evolution  of  the  aggregate  process  f(£)  is  governed  by  the  following 
equation, 

ht)  =  PilWGVJort>lo,  (6) 

and  UGV  is  the  infinitesimal  generator  of  the  process  .'(£)  .  Further, 
the  relation  between  P(t)  and  P(t)  is  given  b-  j 

P(l)^VP(t)U,/ort>lo.  (7)' 

Note  that  the  process  f(t)  is  stocha'tltally  continuous  and  has  a 
single  ergodic  class  such  that 

^IJin  P(<)  =  1  (8)^ 

where  Cj  is  the  ergodic  probability  of  the  aggregate  process  f(t)  cor- 1 
responding  to  the  state  i.  j 

In  this  paper,  the  steady-state  distribution  of  the  process  r(l)  will  : 
be  needed  ,  i.e.,  j 

Prob.lr(t)  =  Si]  =  e,e,y. 


2-2  Additive  Procesaea 

I 

I 

In  what  follows  additive  processes  on  each  group  of  the  process  j 
r(r)  in  the  stretched  time  scale  r  are  introduced  llO).  Earlier  ma- 
jor  w<  ,ks  on  additive  processes  appeared  in  (11,12).  Let  r(r)  €  Si. 
Define  the  function  aJ4,(r(r)]  as  the  (k4)th  element  of  the  matrix  j 
'4i{'(r)  €  5ij.  The  current  value  of  is  denoted  by  of  „ 

when  r(r)  ~  j  €  Si.  An  additive  process  is  defined  as  follows:  '  ■ 


(9)1 


For  a  sulliricntly  large  r,  one  would  expect  the  process  to  be 

a,symptotically  norma]  in  distribution  and  to  satisfy  a  central  limit  1 
theorem:  r 


Lemma  1  Let  Vijr)  =  /l,(r(s)  €  5,]ds,  and  y,,i,u(r)  denote 

the  (k,l]th  element  of  the  matrix  y't.i(T).  Then  all  elementt  V'i.i,H(r) 
are  mutually  independent,  asymptotically  normal  in  distribution  for 
a  sufficiently  large  r  and  satisfy  central  limit  theorems: 


/V(0,^), 


as  T  i.s  sufficiently  large, 


(10) 


The  values  of  crj^i,  i  =  l,...,n,  are  positive  (10).  For  proof  of  the 
above  lemma  one  sees  (10),  (11),  [12,  pp.  118-121)  . 

The  central  limit  theorems  are  useful  in  that  we  exploit  them  to 
derive  the  accuracy  of  the  approximate  model  for  the  decoupled  slow 
mode  subsystem. 

III.  DECOUPLED  SLOW  MODE  SUBSYSTEM 

This  section  considers  the  asymptotic  behavior  of  the  decoupled 
slow  mode  subsystem  given  by 

i(t)  =  /li[r(£))i(£),  i(fo)  =  xo-  (11) 

i  An  approximate  model  for  the  subsystem  over  the  interval  of  fast 
.  transitions  within  each  group  is  derived  based  on  the  aggregation 
method  .  Its  accuracy,  mean-squared  error,  is  quantified  by  using  of 
'  the  Magnus  Expansion  from  Lie  algebra  which  is  known  as  the  con¬ 
tinuous  analogue  of  the  Baker-Campbell-Hausdorff  (BCII)  formula. 
Finally,  the  stability  of  the  subsystem  is  discussed  and  two  stability 
criteria  ue  introduced. 

3.1  An  Approximate  Model 

In  general,  the  state  variables  i(t)  are  not  Markovian.  However, 
the  joint  process  {x{t)^,  r{t)y  is  a  Markov  process  whose  state  space 
is  IZ’’  X  S.  Here  we  formulate  the  joint  probability  density  function 
j  (p.d.f)  of  (x(ty,  r(i)y  which  is  denoted  by  Pij{t), 

j  <  i(£)  <  i-bdi,r(£)  =  j  e  5,}.  (12) 

Recall  that  the  process  r(t)  remains  in  the  ergodic  distribution  for 
i  aU  t  >  to.  Let 

I  Pl>(®>  0  “  0>  (19) 

I  SO  that  by  our  assumption 

I  =  1,  t  €  S,  j  6  Si.  (14) 

Then  define  ^(*,t)  =  (^,(i,t),...,p' Jz,!))’’.  Obviously,  the 
evolution  of  p‘{x,t)  is  governed  by  a  forward  Kolmogorov’s  equa¬ 
tion  (master  equation)  [13  ch.  3,  14,  15).  To  simplify  notation  we 
introduce  the  matrix  operator 

£*  =  diagm^},  (15) 

where  each  diagonal  entry  is  described  by 

A>Pi>(®>0  ~  (16) 

where  (.)*  denotes  the  kth  component  of  the  vector  (.).  Hence,  the 
forward  Kolmogorov’s  equation  is  given  by 

=  £*?(z,  f)  +  (IfT- + 

yi.Xo,  to)  given.  (17) 

Because  the  exact  solution  of  Eq.  (17)  is  difficult  to  find,  the 
singular  perturbation  method  is  applied  to  derive  asymptotic  repre¬ 
sentations  of  Eq.  (17).  This  approach  does  not  require  the  explicit 
solution  of  Eq.  (17),  but  rather,  leads  directly  to  asymptotic  ex¬ 
pansions  of  ?(z,£).  The  basic  idea  is  to  let  p'{x,t)  have  the  outer 
expansion 


^*(*.0=  X]‘V(i,0  (18) 

kssO 

I  where  p*(z,t)  =  (eieiip},(r, <) . e„e„mp^„{x,t))‘^. 

Substituting  this  in  Eq.  (17)  and  equating  the  coefficients  of  like 
'  powers  of  e,  we  obtain 

Cl(|)  :  /’V(*,0  =  0.  Aio,to)  =  fixeto)-  (19) 

0(1):  F^p'{x,t)^^^^-C’Az,t) 

-G^p”(z,(),  p'{io,to)  =  0. 


irhf^re 

Ku  =  E(a;  jr(r)]l  r(r)  6  5.)  = 


(20) 


0(.*) :  FV+'(i,i)  =  -  CVCi.O 

-GV(i.O.  p''(io.«o)  =  0,fc  =  1,2,...  (21) 

The  solutions  for  Eq.  (19)  is 

=  Piix.t),  i  =  1 . n.  j  e  Si.  (22) 

The  solvability  condition  [16,  ch.  15]  for  Eq.  (20)  is  given  by 

-  C'fixj)  -  G'^f(x,t))  =  0,  (23) 

where 

Fp'  =  0,  i  =  1, ...,  n.  ; 

Dehne 

C'pi(xJ)  =  -  ^  ^1((  Y,  *'i 
t=i  j=i+-V, 

With  the  notations  £*,  U  and  V,  Eq.  (23)  can  be  rewritten  in  a, 
matrix  form  i 

§iP{x,l)  =  C'p{x,t)  +  {UGVfp(x,t)  (24)' 

where  p(x,t)  =  (eipi(x,t),...,e„p„(x,t))'^.  I 

Equation  (24)  can  be  interpreted  as  follows  :  p(x,l)  is  the  joint 
p.d.f.  of  the  approximate  stochastic  hybrid  system  ^ 

x(0  =  -4i|?(01*(0.  *(<o)  =  xq.  (25) 

I 

where  ',1,  =  Cjy  */l,y  and  f(t)  is  an  aggregate  FSMC  with: 

the  generator  UGV  ,  and  f(l)  remains  in  the  ergodic  distribution  for 
all  (  >  (q.  In  other  words,  in  the  time  scale  t,  as  c  tends  to  zero, 
each  group  of  the  strongly  coupled  states  is  aggregated  to  a  single 
state  and  the  asymptotic  behavior  of  the  state  variables  within  each' 
group  is  approximaterl  by  a  deterministic  trajectory  of  a  linear  time 
invariant  system. 

To  have  higher  order  approximations,  we  need  to  solve  for  the 
solvability  condition  of  Eq.  (21) 

(p:f(  -  c*p‘(i.  <)  -  gV(x.  0)  =  0.  (26) 

where  i  =  1,  ...,  n,  and  it  =  1,  2,  .... 

3.2  Mean-Squared  Error 

In  this  subsection  the  mean-squared  error  between  the  actual 
states  and  the  approximate  states  is  derived.  Let  z,(l)  denote  the 
state  vector  of  the  subsystem  when  r(l)  takes  values  in  5;.  The 
approximate  model  has  deterministic  trajectories  i,(t)  while  r(t)  so¬ 
journs^  in  the  group  5,-.  The  method  used  to  find  the  mean-squared 
error  is  based  on  the  Magnus  Expansion  from  Lie  algebra  known 
as  the  continuous  analogue  of  the  Oaker-Campbell-Hausdorff  (BCH) 
formula  (17-19). 

We  next  show  how  the  Magnus  Expansion  and  Lemma  1  can  be 
used  to  prove  that  i,(<)  can  be  approximated  by  i,(l)  over  an  interval 
[to,  'f’ll,  where  Ti  -  to  €  (0,c|,c  is  finite. 

Assume  that  r(t)  g  5,  at  the  time  interval  [tcTi].  I^t  0  <  «  <<  1 
and  0  <  t  < <  i  <<  Ti  -  to.  With  the  Magnus  Expansion,  the  state 
transition  matrix  of  the  slow  mode  system  over  the  interval  (to,  to  +  <l 
is  given  by 


where  the  symbol  [., .]  is  the  commutator  product  or  the  Lie  product. 

To  find  the  limiting  behavior  of  the  state  transition  matrix  $(10, 
to  -T  i)  as  e  tends  to  zero,  we  treat  the  terms  in  the  exponent  sepa¬ 
rately.  Then,  combining  all  these  together  yields 

*(to,to  +  S)  =  eip{'/i,«  -I-  A.)  -I-  0(c’)}  (28) 

From  Lemma  1  it  follows  that  all  elements  of  are  mutually  in¬ 

dependent,  asymptotically  normal  in  distribution  with  variance  0(e) 
as  <  tends  to  zero.  Therefore,  it  is  easy  to  verify 

l|mE{||xi(l)-ii(t)||*}  =  0,  t  6[to,<o  +  «l.  (29) 

To  finish  the  approximation  over  the  interval[ta,  Ti]  where  T,  -  lo 
is  the  sojourn  time  of  r(f)  in  the  group  5,-.  we  choose  a  sufficiently 
positive  small  number  S  and  a  positive  integer  K  such  that  S'  <  S 
and  Ti  —  to  =  KS  .  The  limiting  behavior  of  the  state  transition 
matrix  i(to,Ti)  is  computed  by 


lim  $(fo,7i)  =  lim  *('0  +  KS'jo  +  (K  -  l)i') 

...*{to,to  +  S')  =  =  e'Mr.-t,)  (30J 

in  the  mean-squared  sense.  This  implies  that 

limE{||z,(t)-i,(t)||*}  =  0,  t  e[<o,r,].  (31) 

Note  that  if  T)  —  fo  =  oo,  the  mean-squared  error  at  time  Tj  may  be 
unbounded  even  if  t  tends  to  zero.  Ilowever,  it  is  well  known  [20] 
that  the  irreducible  FSMC,  r(t),  has  a  finite  return  time  for  each 
state,  i.e.,  Ti  —  to  is  bounded  with  probability  one.  The  following 
summarizes  the  result. 

Theorem  1  Suppose  that  all  values  o/ Mjy,  j  €  Si,  are  bounded. 
Then  the  solution  z,(t)  to  the  problem  Eq.  (11)  converges  in  the 
mean-squared  sense  to  the  solution  z,(l)  of  the  approximate  model 
Eq.  (is)  as  i  tends  to  zero,  i.e., 

I  lim  E{||zi(f)  -  z,(f)||*}  =  0,  zi(fo)  =  *i(fo) 

for  t  €  vohere  Ti  —  to  is  the  sojourn  time  of  the  r(t)  process  in 

I  the  group  Si. 


■  Note  that  Theorem  1  does  not  show  that  the  approximate  model  is 
I  valid  over  the  entire  interval  [(q,  oo). 


'  3.3  Stability 


In  the  subsection  the  stability  of  the  decoupled  slow  mode  sub- 
'  system  is  studied.  The  stability  criteria  are  based  on  the  logarithmic 
I  norm  (21).  In  order  to  derive  the  stability  criteria,  a  brief  introduc- 
I  tion  to  the  notation  of  logarithmic  norm  is  given  as  follows; 

'  Definition:  The  logarithmic  norm  associated  unth  the  induced  ma¬ 
trix  norm  ||  •  ||  is  defined  by 


I 

)i(>l)  =  lim 

H-,0* 


h 


(32) 


Two  stability  criteria  are  introduced  to  the  decoupled  slow  mode 
subsystem. 

Theorem  2  The  zero  solution  of  the  slow  mode  hybrid  system  to  be 
almost  sure  exponentially  stable,  it  is  sufficient  to  have 


rio+» 

^(<0.  fn  +  i)  =  exp{  j  /l(r(s))ds 
+  2  /l(r(.,'))d.,qds 

+  7  /  (/Urf.s)),  /  [A{r{s'))ds’ .  /  A[r{s")]ds’']ds']ds 

/fo 


n  ^i+«s 

He.-  H  e,j/i(  '/!,>)<  0, 

•=i  A/;+i 

and  necessary  to  have 
n  M+ni 

Y  >  0. 

ml  A/i-fl 


(33) 


(34) 


IV.  DT-COUPLED  FAST  MODE  SUBSYSTEM 


^(Z(0),0)  =  p(Z(0),0). 


(45) 


This  section  e.xnmines  the  limiting  behavior  of  the  decoupled  fast 
mode  subsystem  given  by 

ttiU)  =  /I,[r(f)lr(f),  zUo)  =  r„.  (35) 

Assume  that  all  v-<!iips  of  are  hounded,  i  €  .V,  J  €  5,  and 
r(f)  =  J  €  5,  .  To  analyze  the  state  r((),  the  stretched  time  sc.ale 
6=  (t  —  fo)//*  is  used.  Hence,  expressing  the  system  given  by  Eq. 
(35)  in  the  stretched  time-scale  S  yields 

Z{9)  =  Aj[r{9)]Z{e),  2(0)  =  zo-  (36) 

Similarly,  the  equation  for  the  evolution  of  r(9)  becomes 

dP(9)  u 

=  P(9}(  ttF+  ;r(7),  P(Q)  =  /.  (37) 

Similar  to  Section  3. 1 ,  it  is  known  that  the  joint  process  (Z(9)^ ,  ^(9)) 
is  a  Markov  process  whose  state  space  is  R'  x  5.  Then  we  denote  by 

P,AZ.e). 

p,,(Z,9)  =  Prob.{Z  <  Z(9)  <  Z  +  dz.r{9)  =  >  6  5.},  (38) 

the  joint  p.d.f.  of  the  process  (Z{9)^,  r(9))^.  Defining  p(Z.  9)  = 
(Pii(^.  ^).  •1  PnmiZ.  9))^,  then  the  ;>( Z,  9)  is  governed  by  the  for¬ 

ward  Kolmogorov's  equation  shown  below, 

=  rp{Z,9)  +  {ftF^  +  /.g^)p(Z.9). 
p{Z{Q),0)  given.  (39) 


From  Eq.  (45)  it  follows  that  r(9)  takes  values  among  the  fast  states 
of  the  group  Si,  then  Zi(9)  behaves  approximately  as  any  time  invari¬ 
ant  systems  with  constant  system  matrix  held  to  their  values  at  the 
last  transition  of  r(9).  The  following  theorem  addresses  the  problem 
for  the  case. 

Theorem  4  Assume  that  n  =  0(()  and  Um,_o/i/c  — >  k  where  k  = 
0(1).  Then  Z,{9)  is  approximately  modeled  as  a  hybrid  system  Eq. 
(36)  depending  only  on  the  fast  states  of  the  ith  group  with  generator 
Fik  in  the  9  time  scale. 

Case  III.  f  =  o(/()  and  lim,_.o/'/f  -*  co¬ 
in  this  case  we  define  the  symbol  -  =  p'  where  p  tends  to  zero 
as  t  tends  to  zero.  Let  p(Z,9)  have  the  outer  expansion  given  by  Eq. 
(40).  Repeating  the  same  procedure  discussed  before  yields  the  same 
results  in  Section  III. 

Theorem  5  Suppose  that  t  =  o(p)  and  lim,_o /i/e  — ►  exs  and  r(to)  € 
Si  and  all  values  of  ^Ai  are  finite.  Then,  for  t  6  [toiTJ 

1.  2i(0  can  he  approximated  as  an  average  system  whose  dynam¬ 
ical  equations  are  described  by 

pzi{t)  =  *i4iii(t),  ii(to)  =  ^(fo)-  (46) 

S.  The  mean-squared  error  between  z,(t)  and  ii(l)  tends  to  zero 
as  €  tends  to  zero. 


In  the  sequel  singular  perturbation  methods  are  used  to  derive  the 
asymptotic  expansion  of  p{Z.9).  The  limiting  behavior  of  the  solu¬ 
tion  of  Eq.  (39)  depends  on  the  relative  size  of  p  and  e  as  both  they 
tend  to  zero.  There  are  three  cases  to  be  consiclered; 

Case  p  =  0(c)  and  lim(_o  f  —•  0. 

In  this  case  we  introduce  the  symbol  p'  =  S.  p'  tends  to  zero  as 
f  tends  to  zero.  p{Z,9)  has  the  outer  expansion 

p(z,9)  =  f;f;(/i')V'p‘*'(z,(?)  (40) 

*=o  *'=0 

where  p**  (Z,9)  =  [eieuptf  (Z,9),...,e„e„„pJ|^(Z,9)p. 

Substituting  this  in  Eq.  (39)  and  equating  the  coefficients  of  like 
powers  of  <  and  solving  for  the  leading  order  term  yields 


V.  THE  COUPLED  CASE 

This  section  considers  the  singularly  perturbed  stochastic  hybrid 
system  with  Eq.  (1)  -(3).  The  purpose  of  this  section  is  to  define 
a  slow  mode  subsystem  that  describes  the  slow  dynamics  and  a  fast 
subsystem  that  describes  the  fast  dynamics.  Their  solutions  are  then 
used  to  approximate  x(t)  and  z{t)  over  the  interval  of  fast  transitions 
within  each  group.  The  decomposition  of  the  system  and  the  r(t) 
process  together  into  slow  and  fast  subsystems  depends  on  the  rela¬ 
tive  size  of  p  and  c.  First,  consider  the  case  p  =  o(f)  and  p  =  0((). 

The  slow  mode  subsystem  over  the  interval  of  fast  transitions  within 
each  group  is  approximated  by  a  linear  time-invariant  system  model. 
The  fast  subsystem  must  be  redefined  over  the  interval  of  each  fast 
transition  of  the  r(<)  process. 

Secondly,  the  case  0<€<p<lis  considered.  In  this  case 
the  system  can  be  regarded  as  a  slow  mode  subsystem  with  system 
matrix 


f^{Z,9)  =  eiei,S(Z-  Z(9)),  (41) 

where  Z{9)  =  e  ’'^•>*Z(0).  Equation  (41)  can  be  interpreted  as  fol¬ 
low;  if  r(0)  =  y  €  5,-  and  Z(0)  =  r(<o).  4he  limiting  behavior  of  Z(9) 
is 

Z(9)  =  e  ’^-’^(<0).  (42) 


>«(r(«)]  = 


^i[r(<)] 


4ri  (’•(■)] 


^I7(r(01 

4rir(l)] 


(47) 


with  the  initial  conditions  Xi(to)  =  xo  and  Zi(<o)  =  tq.  Since  all  the 
subraatrices  are  bounded,  the  solution  to  the  system  converges  in  the 
mean  square  sense  to  an  approximate  model  with  averaged  system 
matrix 


which  when  transformed  to  the  time  scale  t  becomes 

z(t)  =  e  ’''•'<'-'<>>/‘'r(<o),  for  t  >  tg.  (4.3) 


(48) 


If  all  the  values  of  ’’Aij  are  stable,  then  r(<)  can  be  approximated 
by  Eq.  (43).  The  following  theorem  summarizes  the  case. 

Theorem  3  Assume  that  p  =  o(()  and  tim,_o/*/<  0>  oil  the 

values  of  ^Aij  are  stable,  and  r(to)  =  /  6  Si.  Then  as  c  — »  0 

z(t)  =  exp{  ^Aij{t  -  ta)/p)z(to),  for  t  >  to-  (44) 

Case  II.  p  =  0(e)  and  lim,_o/r/<  -•  k  where  k  =  0(1). 

The  case  introduces  the  symbol  ?  “  ^  4"  ®(f*  )  where  p  tends 
to  zero  as  e  tends  to  zero.  Like  the  first  case,  p(Z,9)  has  the  outer 
expan.sion  given  by  Eq.  (40).  Substituting  this  in  Eq.  (39)  and 
equating  the  coefficients  of  like  powers  of  e  and  solving  for  the  leading 
order  term  yields 

/lyi^fZ.g)  ^  c'p(Z,9)^(kF'^)f^(Z,9), 


when  r(t)  takes  values  in  Si  for  to  <  f  <  Ti. 

To  analyze  the  behavior  of  the  system,  it  is  assumed  that  all 
the  values  of  system  matrices  ’A;  are  invertible  and  stable.  Thus,  a 
desired  slow  dynamics  is  given  by 

i.(t)  =  (’A.--*’A.(M,)-'(»’A.))i.(t), 

=  '  Aii,(f),  ri(<o)  =  ^0-  (49) 

and  a  desired  fast  dynamics  given  by 

p'zf_i  =  *A,r/,,(t),  (50) 

with  the  initial  condition  zf.i(tg)  =  zg  -  AiXg.  Finally,  the 

solutions  of  Eq.  (49)  and  Eq.  (50)  are  used  to  approximate  the 
original  slow  and  fast  states. 


Tlieorein  6  /l.<srimf  that  0  <  ^  /i  1,  r(t)  takes  values  in  Si, 
for  I  g  [/o,3'il,  and  all  the  values  of  system  matrices  ’/i,-  are  stable 
and  invertible.  Then 

1.(0  =  i,(0 +  0(/i).  (51) 

^.(0  =  -V..(0-(’.i.)-'{  ’M.)i,(()  +  0(/i)  (52) 

where  ii(l)  and  sj,i{t)  are  respective  stales  of  the  slow  model  Eq. 
(fO)  and  the  fast  model  Eq.  (50). 

VI.  AN  EXAMPLE 


An  example  to  illu.strate  the  method  in  Section  V  is  demonstrated 
here.  Tlie  systems  are  given  by  Eq.  (l)-(3)  where  r(f)  g  R' ,  z(t)  g 
/J*.  and  the  switching  process  r(i),  shown  in  Fig.l,  consists  of  two 
groups.  Eacli  group  contains  two  strongly  interacting  states.  Its 
generators  are  given  by 
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The  corresponding  system  matrices  given  by  Eq.  (47)  of  the  states 
for  the  two  groups  are 


-4,, 


-3  2 

3  -4 


An 


3  2 
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respectively.  Thus,  aggregate  models  obtained  from  Eq.  (48)  for 
both  groups  are 


-2.4  1.4 

2.4  -3.4 


,  A7  = 


2.4  1.4 

2.4  -3.4 


The  example  considers  two  cases:  (i)  ft  =  0.01,  <  =  0.001,  (ii) 
/r  =  0.01,  e  =  0.0001.  Given  the  initial  conditions  *i(0)  =  [2.0,3.0p 
and  [lo.'hl  =  [0,  0.3],  the  trajectories  of  Xi(t)  and  i,(<),  ^i(0 
Zi{t)  within  the  group  5,-,  i  =  1,2,  are  shown  in  Fig.  2  to  Fig.  5  for 
the  two  cases.  The  simulation  results  illustrate  that  the  approximate 
models  are  valid  when  0  <  «  •<  /i  <C  1. 

VII.  CONCLUSIONS 

This  paper  considered  the  asymptotic  trajectory  of  a  singularly 
perturbed  hybrid  system  whose  state  equations  depend  on  a  near- 
decomposable  finite  state  Markov  chain.  The  Umiting  behavior  of  the 
decoupled  slow  mode  subsystem  over  the  interval  of  fast  transitions 
within  a  group  can  be  approximated  by  an  averaged  value  of  the 
system  matrix  over  all  their  values  depending  on  the  fast  states  of 
the  group.  The  mean-squared  error  between  the  approximate  model 
and  the  original  one  tends  to  zero  when  t  tends  to  zero.  The  stability 
of  the  decoupled  slow  mode  subsystem  is  discussed  and  two  criteria 
are  introduced.  The  limiting  behavior  of  the  fast  mode  subsystem 
over  the  interval  of  fast  transitions  within  each  group  depends  on 
the  relative  size  of  /i  and  f  when  both  fi  and  €  tend  to  zero.  The 
subsystem  can  be  approximated  as  a  time  invariant  system  with  the 
constant  system  matri.x  held  to  the  value  at  the  initial  transition 
when  /j  =  o(f).  The  subsystem  can  be  modeled  as  a  hybrid  system 
depending  only  on  the  fast  states  of  each  group  when  =  0(e).  The 
subsystem  can  be  approximated  as  an  averaged  value  of  the  system 
matrix  over  all  their  values  depending  on  the  ftist  states  of  each  group 
when  f  -  niji).  In  the  coupled  case  approximate  models  for  reduced 
order  systems  are  investigated  according  to  the  relative  size  of  the 
two  perturbation  parameters.  The  results  are  shown  to  hold  when 
the  switching  process  is  stationary  and  irreducible,  each  group  of 
strongly  interacting  states  is  irreducible  and  time  reversible. 

Adflitional  work  remains  to  be  done  in  the  analysis  of  the  lim¬ 
iting  behavior  of  singularly  perturbed  hybrid  systems  with  control, 
or  with  noise,  or  both  by  using  the  preceding  methodology.  This 
approach  permits  a  unified  treatment  of  approximate  models.  The 


results  presented  in  this  paper  provide  an  initial  step  in  facilitating 
the  analysis  of  the  behavior  of  singularly  perturbed  hybrid  systems 
with  control  or  noise. 
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ABSTRACT 

This  paper  studies  the  stabilization  of  a  stochastic  hybrid  system  whose  state 
equations  are  governed  by  a  stochcistic  switching  process,  which  is  modeled  as  a  con¬ 
tinuous  time  finite  state  Markov  chain  (FSMC).  First,  Linear  feedback  laws  with 
non-switching  gains  are  proposed.  The  non-switching  gains  are  computed  based  on 
the  suflScient  conditions  derived  for  the  definition  of  non-switching  stochastic  stabi- 
lizability.  Secondly,  Linear  feedback  laws  with  imperfect  detectors  are  studied.  The 
range  of  the  detection  probability  for  the  detectors  are  computed.  The  results  are 
shown  to  hold  when  the  Markov  chain  is  irreducible  and  the  system  states  axe  per¬ 
fectly  observed. 

I.  INTRODUCTION  AND  PROBLEM  FORMULATION 
1.1  Introduction 

The  present  paper  is  concerned  with  the  stabilization  of  a  claiss  of  stochctstic  hy¬ 
brid  systems  .  The  state  space  of  a  stochastic  hybrid  system  is  a  cross  product  of 
an  Euclidean  space  and  a  finite  discrete  space.  Basically,  stochastic  hybrid  systems 
are  a  special  type  of  linear,  piecewise  constant,  time  varying  systems  which  switch 
randomly  among  a  finite  number  of  linear  time  invariant  models.  The  switching  be¬ 
haves  like  a  continuous  time  finite  state  Markov  chain  (FSMC).  Such  systems  have 
been  successfully  used  to  model  pilot  commands  in  target  tracking,  isolation  levels  of 
^This  research  was  supported  by  the  U.  S.  Air  Force  under  grant  AFOSR-89-0241 
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solar  receivers,  abrupt  variation  in  the  parameters  of  economic  systems,  and  systems 
subject  to  sudden  component/sensor  failures  or  repairs,  abrupt  environmental  distur¬ 
bances  and  changing  subsystem  interconnections  [1].  Preliminary  work  established 
the  optimal  control  solutions  for  stochastic  hybrid  systems  [2-7].  For  a  quadratic  per¬ 
formance,  the  optimal  linear  feedback  control  law  with  switching  gains  has  already 
been  proposed.  On  the  other  hand,  several  schemes  to  stabilize  stochaistic  hybrid 
systems  are  investigated  in  [Ij.  Another  study  [7]  developed  the  new  definition  of 
stochcistic  stabilizability,  and  then  established  sufficient  and  necessary  conditions  for 
this  definition.  Indeed,  most  of  previous  techniques  require  both  the  continuous  states 
and  the  value  of  the  Markov  chain  to  be  measured,  in  order  to  implement  on-line 
the  feedback  laws  with  switching  gains.  In  practice  some  information,  such  as  the 
complete  knowledge  about  the  Markov  chain,  is  often  difficult,  if  not  impossible,  to 
obtain.  Some  control  strategies  with  less  knowledge  of  the  Markov  chain  would  be 
more  realistic  [1,8]. 

Our  aims  here  are  to  develop  new  stabilization  schemes  for  a  class  of  stochastic 
hybrid  systems.  The  schemes  require  less  knowledge  of  the  Markov  chain.  These 
techniques  are  expected  to  aid  in  the  design  of  controllers. 

1.2  Problem  Formulation 

The  system  models  under  consideration  are  assumed  to  have  the  following  state 
equations: 

ir(0  =  A[r(f)]x(0  -I-  5[r(t)]u(<),  (1) 

where  t  G  [^oiT],  T  may  be  finite  or  infinite,  x{t)  G  i?*  represents  the  system  states 
and  u{t)  G  BT'  the  control.  All  matrices  are  of  proper  dimensions  and  are  random 
through  their  dependence  on  the  values  of  the  random  process  r(i),  called  “form 
index”.  The  form  index  r{t)  is  governed  by  a  continuous-time  FSMC  taking  values 
in  a  finite  set  S  =  {1, 2, ....,  TV}.  The  evolution  of  the  form  index  r{i)  with  time  is 
described  by  the  state  transition  probabilities  of  associated  FSMC  on  S 
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(2) 


Prob.{r{t  +  A)  =  j|r(<) 


A,j  A  +  o(A) 

1  -  A.  A  +  o(A) 


if  i  ^  j 
if  i  =  j 


N 

A.  =  E 


where  A  >  0,  and  all  values  of  A’s  are  finite.  Let  A  be  the  generator  of  the  r(t)  process. 
Assume  that  the  initial  values  io»  and  tq  are  independent  random  variables;  xq  and 
2o  are  also  independent  of  the  <T-algebra  generated  by  {r(t),t  €  (to,?’]}.  The  current 
value  of  the  system  model  is  denoted  by  an  index,  for  example,  A,-  and  Bi  will  denote 
A[r(t)]  and  5[r(t)]  when  r{t)  =  i.  The  paper  assumes  that  the  x{t)  is  perfectly 
observed  and  the  r(t)  process  is  irreducible.  The  ergodic  distribution  of  the  r(t) 
process  is  given  by 


lim  Prob.{r{t)  =  j|r(0)  =  i}  =  ej,  i,  j  ^  S 

t— *00 

The  paper  is  organized  as  follows.  Section  II  develops  linear  feedback  laws  with 
non-switching  gains  when  the  controllers  are  allowed  to  feedback  only  the  continuous 
states.  Linear  feedback  laws  with  switching  gains  are  considered  in  Section  III  when 
practical  detectors  are  used  to  observe  the  value  of  the  Meurkov  chain.  Three  illustrate 
examples  are  given  in  Section  VI.  Section  V  concludes  the  paper. 


II.  Stabilization  Via  Non-switching  Gains 


This  section  considers  the  scheme  to  stabilize  the  system  given  by  Eq.  (l()-(2) 
without  ciny  knowledge  about  the  r{t)  process.  In  [8]  the  author  heis  showed  that  non¬ 
switching  control  gains  for  a  class  of  stochastic  hybrid  systems  may  be  preferable,  in 
addition  to  the  fact  that  they  are  much  eaisier  to  implement.  In  what  follow  some 
notations  and  the  definition  of  non-switching  stochastic  stabilizability  are  introduced. 
Sufficient  conditions  for  the  new  definition  are  derived.  The  non-switching  gains  are 
computed  based  on  the  sufficient  conditions. 

Let  x{t,  xq,  u)  denote  the  trajectories  of  the  random  processes  x(t)  from  the  initial 
states  x(0)  =  xo,  under  the  action  of  the  admissible  control  u(f)  and  every  sample 
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path  of  r{t).  A  new  definition  of  non-switching  stochastic  stabilizability  ,  modified 
from  the  definition  in  [7],  is  described  below. 

Definition  l-System  (l)-(2)  is  said  to  be  non-switching  stochastically  stabilizable 
if,  for  any  finite  iq  €  7?",  there  exists  a  linear  feedback  control  law  L  that  is  constant 
for  all  values  of  r(t)  €  S; 

u{t)  =  —Lx{t) 

such  that  there  exits  a  symmetric  positive  definite  matrix  M  satisfying 
T 

lim  E{  I  X  (t,xo,u)x(t,xo,u)dt\  xq}  <  XqMxq 

T—oo  JQ 

where  ||Z,||  <  oo,  and  x'{Al')  denotes  the  the  transpose  of  the  vector  x{t)(  the  matrix 
M). 

From  the  above  definition,  non-switching  stochastic  stabilizability  of  a  system 
means  that  there  exists  a  linear  feedback  law  which  drives  the  x  states  from  any  finite 
initial  states  Xq  asymptotically  to  the  origin  in  the  mean  square  sense.  Sufficient 
conditions  for  non-switching  stochastic  stabilizability  are  derived  as  follows: 

Theorem  1  System  (l)-(2)  is  non-switching  stochastically  stabilizable  if ,  for,  for 
each  form  i  €  S,  there  exist  a  control  law  u{t)  =  —Lxft)  such  that  for  any  given 
positive  definite  symmetric  matrix  Ni,  the  (unique)  set  of  symmetric  solutions,  M,, 
of  the  N  coupled  matrix  equations 

1  1  ^ 

{Ai-BiL--\i)'Mi  +  Mi{Ai-BiL--Xi)-^  Y,  = -iV.-  (3) 

are  positive  definite  for  each  f  6  5. 

Note  that  Theorem  1  does  not  require  the  eissumption  of  the  irreducibility  of 
the  r(t)  process.  The  proof  of  the  theorem  is  similar  to  the  proof  in  [6]  except  the 
unobserved  value  of  the  r{t)  process.  In  applying  Theorem  1,  we  choose  the  control 
law  L,  let  Ni  be  identical  or  simple  diagonal  matrices,  and  then  solve  for  Eq.  (3)  to 
obtain  the  symmetrical  and  positive  definite  matrices  {M,  :  i  €  5}. 
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The  sufficient  conditions  in  Theorem  1  are  difficult  to  check.  There  exits  a  simple 
necessary  condition  for  non-switching  stochastic  stabilizability  of  system  (l)-(2).  If 
system  (l)-(2)  is  non-switching  stochastically  stabilizable,  then  in  each  form  i,  L  can 
be  chosen  such  that  all  the  closed-loop  system  matrices  (Ai  —  BiL  —  ^A,)  are  stable. 

The  non-switching  gains  can  be  computed  based  on  the  following  procedure.  First, 
let  the  control  law  L  be  chosen  such  that  all  the  matrices  {Ai  —  BiL  —  ^Xi  :  i  €  S'} 
are  stable.  Secondly,  let  {Ni  :  i  £  S]  be  identical  or  simple  diagonal  matrices,  and 
then  solve  for  Eq.  (3)  to  obtain  a  set  of  symmetrical  matrices  {Mi  :  i  6  S’}.  Finally, 
stop  the  procedure  if  all  the  matrices  M,  are  positive  definite.  If  not,  go  to  the  first 
step  and  repeat  the  procedure  again.  An  example  (Example  1)  to  illustrate  such 
procedure  is  shown  in  Section  IV. 

III.  STABILIZATION  VIA  SWITCHING  GAINS  WITH  IMPERFECT 

DETECTORS 

In  the  previous  section,  stabilization  of  stochastic  hybrid  systems  by  a  nonswitch¬ 
ing  linear  constant  feedback  law  was  introduced.  The  main  advcintage  of  this  stabi¬ 
lization  scheme  is  lack  of  the  need  for  detection  and  estimation  of  the  r(t)  process. 
However,  despite  the  simplicity  of  this  scheme  it  does  not  permit  a  large  class  of  such 
systems.  To  alleviate  the  shortcoming,  switching  gain  stabilization  is  considered  in 
this  section.  The  scheme  requires  a  form  index  detector  to  detect  the  current  value 
of  the  r{t)  process.  The  detector  is  with  the  following  characteristics: 

•  All  jump  times  of  the  r{t)  process  can  be  detected. 

•  Let  r*(f)  denote  the  output  of  the  detector.  The  value  of  the  r*{t)  over  the 

interval  where  tk  and  are  two  successive  jump  times  of  the  r{t) 

process,  remains  constant.  The  relationship  between  r{t)  and  r*(/)  at  the  jump 
time  tk  is  given  by 

Prob.{r’{tk)  =  j\r{tk)  =  i',  }  =  |  ^ 
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where  p  is  the  detection  probability  of  the  detector  for  each  form  and  q  = 

In  the  scheme  the  linear  feedback  control  law  depends  on  the  system  states  and 
the  value  of  the  detector,  i.e., 


u{t)  =  -L[r*(t)]x(0, 


.  where  ||Z<,||  <  oo.  Thus,  the  closed-loop  system  becomes 


x(0  =  ^’(01^(0  (4) 

where  /4[r(t),  r*(t)]  =  >l[r(t)]  —  5[r(t)]Z,[r*(t)]. 

In  what  follows  the  stability  of  the  system  given  by  Eq.  (4)- (2)  is  studied  and 
two  stability  criteria  are  introduced.  To  have  the  stability  criteria,  a  brief  review 
to  the  notation  of  logarithmic  norm  is  given.  The  logarithmic  norm  (also  called  the 
measure  of  matrix)  was  investigated  in  1958  separately  by  Dahlqnist  [9]  and  Lozinskij 
[10]  .  The  properties  of  the  norm  have  been  well  documented  in  [11].  The  norm  has 
been  applied  extensively  to  study  the  growth  of  the  solution  of  linear,  time  varying 
systems.  Below  is  the  definition  of  the  logarithrm’c  norm. 

Definition  2:  The  logarithmic  norm  associated  with  the  induced  matrix  norm  ]]  •  ][ 
is  defined  by 


u(A)  =  lim 
^  '  0—0 


\\I  +  0A\\-l 

e 


With  the  norm  p.  and  the  irreducibility  of  the  r[t)  process,  we  derive  the  following 
important  lemma  in  that  we  use  it  to  find  the  conditions  for  the  stability  criteria  of 
the  system  given  by  Eq.  (4)-(2). 


Lemma  1  Let  the  r{t)  process  be  irreducible.  Then 

jiirn  ^  I  ^(^[r(i),r*(0])d<  =  w.p.l 

where  Aij  denote  the  current  value  of  y4[r(f),r*(/)]  when  r(t)  =  i,  r'’(f)  =  j. 

Proof:  Let  Ti  denote  the  total  sojourn  time  over  the  interval  [0,  T]  for  each  form  i  of 
the  form  index  r(<).  Since  r{t)  is  an  irreducible  FSMC,  it  is  well  known  [12]  that 
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Ti 

Jim  =  e,-,  w.p.l 

T^oo  1 

Further,  Tij  is  defined  as  the  total  time  when  r{t)  =  i  and  r*(t)  =  j  over  interval 
[0,  T].  It  is  ea^y  to  show  that 

p{Aij)Tij  _  (  peip{Aii)  if  i  =  j,  w.p.l 
T  \  qeip{Aij)  if*#i,  w.p.l 

This  completes  the  proof. 


Theorem  2  The  null  solution  of  the  system  given  by  Eq.  (4)-(2)  is  almost  sure 
exponentially  stable  if  it  is  sufficient  to  have 

ei{pp{Aii)  +  q  Y  <  0 

i=i 

and  necessary  to  have 

Y  +  ^  Y  >  0 

t=I  j=lj¥« 

Proof:  From  the  theorem  in  [11,  pp.  89]  it  follows  that 


<  ||x(r)||  <  Me/.’' 


As  T  tends  to  infinity.  Lemma  1  yields  the  results. 

If  p=l,  i.e.,  the  detector  is  perfect,  the  results  are  shown  in  [Ij.  In  other  words,  if 
the  system  states  and  the  value  of  the  r[t)  process  «ire  perfectly  observed,  and  if  the 
linear  feedback  law  with  switching  gains  satisfies 

(5) 

1=1 

then  the  null  solution  of  the  closed-loop  system  is  almost  sure  exponentially  stable. 
There  arises  a  interesting  problem:  if  the  control  law  with  switching  gains  is  designed 
to  satisfy  the  condition  given  by  Eq.  (5),  what  is  the  the  range  of  p  such  that  the 
closed  system  is  almost  sure  exponentially  stable  when  the  imperfect  detector  is  used? 
The  following  corollaries  answers  the  problem. 
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Corollary  1  t/Eili  e,/i(/4„)  <  0  and  <0,  andO  <p  <  I, 

then  the  system  given  by  Eq.  (i)-(2)  is  almost  sure  exponentially  stable. 

Corollary  2  i/Eili  e,7x(i..)  <  0  and  >  0,  and 


p  >  1  + 


jvhj  e,  p(A,j)  -  ^iM(A„) 

then  the  system  given  by  Eq.  (4)-(2)  is  almost  sure  exponentially  stable. 


(6) 


IV.  EXAMPLES 


In  this  section  three  examples  are  provided  to  illustrate  the  methods  derived  in 
Section  II  and  III.  The  first  example  demonstrates  that  a  system  given  by  Eq.  (1)- 
(2)  can  be  stabilized  by  using  only  a  linear  feedback  law  with  nonswitching  gains, 
i.e.,  any  control  law  with  switching  gains  does  not  satisfy  the  sufficient  condition  (or 
the  necessary  condition)  given  by  Eq.  (5).  The  second  example  examines  a  system 
which  can  be  stabilized  via  both  two  methods  discussed  before.  The  range  of  the 
detection  probability  for  the  detector  is  computed.  Finally  a  system  which  can  not 
be  stabilized  by  the  method  in  Section  II  but  can  be  stabilized  by  the  method  of 
Section  III  is  considered. 

Example  1:  Consider  a  system  with  the  form  index  r{t)  taking  values  in  a  finite 
set  S  =  {1,2}  with  the  generator 


The  system  and  input  matrices  are  given  by 


A,  = 


0  1 
2  3 


= 


1 

0 


A? 


0  1 
4  5 


,  B2 


0 

1 


This  example  chooses  the  linear  feedback  laws  L  and  {Ni  ;  t  €  S}  cis  follows; 
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I  =  [5,  17],iVi  = 


1  0 
0  1 


TV,  = 


0  2 
0  2 


Solving  for  Eq.  (3)  yields 

2.2798  0.2418  ' 

0.2418  0.4156 

Since  the  solutions  M\  and  M2  are  symmetric  and  positive  definite,  the  system  is 
non-switching  stochastically  stabilizable.  However,  this  system  can  not  be  stabilized 
by  the  method  in  Section  III.  Given  the  logarithmic  norm  associated  with  the  induced 
matrix  norm  ||  •  ||i  (  or  ||  •  II2  or  ||  •  ||oo),  for  any  set  of  designed  switching  gains  Li  and 
L2,  it  is  shown  that 


Ml  = 


0.7634 

1.2792 


1.2792 

7.9064 


eifi(Ai  —  BiLi)  -f  e2fi{A2  —  B2L2)  >  0 


ei^(Ai  —  B1L2  +  G2fJ'{A2  ~  B2L1)  >  0 


The  sufficient  condition  given  in  Theorem  2  does  not  hold. 
Example  2:  Consider  a  system  with 


A  = 


-2  2 
3  -3  ’ 


0  1 
2  3 


Bi 


0  1 
4  5 


,B2 


1 

2 


The  stationary  distribution  vector  of  the  r(t)  process  is  given  by 

[=.,  |i 

Given  the  logarithmic  norm  associated  with  the  induced  matrix  norm  ||  •  ||i  and 
Li  =  [2,  4],  Z/2  =  (2,  5],  we  have 


eifi(^Ai  —  B\Li)  -f-  e2^(^2  —  .^2^2)  —  — 0.4  >  0 


cipi^Ai  —  B1L2  "b  ^2f^{A2  ~  B2L1)  =  1.2  >  0 
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Thus,  from  Corollaxy  2  it  follows  that  p  >  0.75.  In  other  words,  the  detector  must 
the  detect  probability  p  where  p  >  0.75. 

On  the  other  hand,  this  system  can  be  stochastically  stabilized  by  using  a  non¬ 
switching  feedback  controller.  Let  L  =  [2,  3].  Solving  for  Eq.  (3)  yields  two  symmet¬ 
rical  and  positive  definite  matrices  A/i  and  M2  where 

19  144  J  L  3S  144 

Example  3:  Consider  the  two-form  system  with 
Ar  =  ^,  Bi  =  l,  A2  =  ^,  B2  =  -1 
and  the  generator  of  the  r(^)  process 


Obviously,  this  system  is  not  nonswitching  stochastically  stabilizable.  We  design 
Li  =2,  L2  =  —3  such  that  the  sufficient  condition 

-  BM  -F  ^p{A2  -  B2L2)  =  -|  <  0 

is  satisfied.  Then  we  compute 

q  9  10 

-p(i4i  —  B1L2)  +  -^(.^2  —  B2L1)  ~  ^  ® 

From  Corollary  2  it  follows  that  p  >  |.  Thus,  to  stabilize  the  system,  the  detector 
must  have  the  detect  probability  p  where  p  >  |. 


V.  CONCLUSIONS 

This  paper  considered  the  stabilization  of  stocheistic  hybrid  systems  whose  state 
equations  depend  on  continuous  time  finite  state  Markov  chains.  Non-switching  feed¬ 
back  laws  have  already  been  studied  when  the  controllers  are  allowed  to  feedback  only 
the  system  states.  The  non-switching  gains  are  calculated  based  on  the  sufficient  con¬ 
dition  for  non-switching  stochastic  stabilizability.  Furthermore,  linear  feedback  laws 
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with  the  practical  detectors  are  proposed.  The  range  of  the  detection  probability  of 
the  detector  is  computed  according  to  the  sufficient  conditions  for  the  almost  sure 
exponential  stability  of  the  closed-loop  systems. 

Additional  work  remains  to  be  done  in  stabilizing  singularly  perturbed  stochastic 
hybrid  systems  which  have  been  studied  in  [13,14].  The  results  presented  in  the  paper 
provide  a  initial  step  in  facilitating  the  work. 

Finally  these  results  may  be  extended  to  the  optimal  control  problems  for  singu¬ 
larly  perturbed  stochastic  hybrid  systems. 
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ABSTRACT 

Canonical  forms  for  discrete  linear  N-periodically  lime-varying 
(LP)  completely  reachable  systems  i(<:)  =  Ai,x{k) 
and  y(k)  =  Ckx{k)  -r  Diu{k)  are  presented,  which  generalize 
the  linear  time-invariant  (LTI)  case.  The  derivation  is  first 
accomplished  through  an  equivalent  LTI-quadruple  to  4A'-tuple 

. n-i-  This  LTI  system  is  revealed  to 

be  a  subcomponent  in  a  decomposition  of  a  given  discrete  LP 
system  represented  by  the  4IV-tupie.  Finally,  an  application 
of  the  obtained  canonical  forms  is  demonstrated  in  a  control 
problem:  eigenvalue  assignment  of  the  monodromy  matrix. 

I.  INTRODUCTION 

Linear  periodically  time-varing  (LP)  systems  have  been 
studied  by  many  researchers  (6,  and  references  therein).  LP 
systems  are  suitable  models  for  some  periodic  behaviors  such  as 
seasonal  phenomena  and  rhythmic  biological  movement.  It  has 
also  been  noted  that  LP  controllers  give  linear  time-invaraint 
(LTI)  plants  more  robust  control  in  the  maximum  attainable 
gain  margin  sense  |l][3|. 

Motivated  by  the  above,  we  have  investigated  more  pre¬ 
cise  mathematical  descriptions  of  LP  systems  j7jl8l.  In  this 
work,  we  primarily  show  a  state-space  canonical  form  for  dis¬ 
crete  LP  systems  x(k-f-l)  =  Aix(k)~  Bkii{k),  y(k)  =  Ckx{k)  + 
Di,u(k)  wWe  the  quadruple  is  N-periodic  (A»,.B,,Ct,I5»)  = 
{Ak^N^Bk+N,  As  for  LTI  systems,  we  beiieve  that 

the  canonical  forms  for  the  discrete  LP  systems  play  the  same 
role  in  such  problems  as  realization,  control  and  identification. 

In  section  11,  a  reachable  canonical  form  is  derived  us¬ 
ing  an  equivalent  quadruple  to  a  4A'-tupie  [{Aq,  Bo,  Co,  Do), 
(Ai,Bi,Ci,Di],...,(Afi^i,Bn-i,Cf,-i,Ds-i))-  In  section  III, 
a  system  decomposition  S’L,S  is  derived  for  a  given  discrete 
LP  system,  and  the  equivalent  quadruple  is  revealed  as  the  re¬ 
alization  matrices  for  the  LTI  system  L,.  In  section  IV,  a  typ¬ 
ical  application  is  demonstrated  using  the  reachable  canonical 
form  along  with  the  stability  analysis  and  feedback  connection 
rule  developed  in  section  III:  eigenvalue  assignment  of  the  mon¬ 
odromy  matrix  for  a  completely  reachable  discrete  MIMO  LP 
system. 

Throughout  this  work,  LP  systems  are  assumed  to  be  of 
dimension  n  with  m  inputs,  p  outputs  and  period  N  such 
that  (At,B*,C»,Ut)  e  R"-"  x  R’”'"  x  R”*"  x  R''""*  for 
k  €  {0,1,..., A  —  1}.  Bold  face  characters  are  reserved  for 
matrices  and  vectors  of  ’big’dimension  (a  multipie  of  A')  such 
as  (A„B.,C,.D.)  €  x  iZ*"*^”*  x  x 

and  r(Jk)  €  R^‘.  The  superscript  is  used  to  indicate  a  set 
such  as  {0,1, 2,..., A'  -  1/  =  A”. 


n.  STATE-SPACE  CANONICAL  FORMS 

Before  deriving  the  canonical  forms,  let  us  recall  the  sig¬ 
nificance  in  general  setting.  Let  an  'objective'  function  F  on 
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a  parameter  set  X  be  given.  In  many  situations,  the  function 
F  is  many-io-one  so  that  we  can  find  an  invariant  partitioning 
{A,  C  X  :  U.e/  A’,  =  X  for  some  index  set  1,  X.DX,  =  d  for 
»  }} 

D{2)  =  F[y)  <=  x,yiX, 

The  parameter  set  X  is  then  too  ’redundant’  with  respect  to  the 
function  F.  Therefore,  it  is  natural  to  select  a  representative 
or  a  canonical  element  i,  6  X,  for  the  subset  A',.  This  selecting 
process  would  be  understood  as  a  map  A  from  X  into  X.  Such 
map  A  is  called  a  canonical  form  for  the  parameter  set  X  under 
the  partitioning  {X,  :  i  €  /}  with  respect  to  the  objective 
function  F.  The  function  R  on  A  is  ’simplified’  by  restricting  its 
domain  to  the  subset  {z,  :  i  €  /}  without  missing  the  original 
objective 

D  =  Rile,)  h 

If  such  parti. ioning  is  induced  by  an  equivalence  relation  R, 
the  objective  function  R  and  a  canonical  form  A  are 

■fW=R(y)  ■<=  zRy  (1) 

A(z)  =  A(y)  o  X  By  (2) 

A(i)  R  I  (3) 

More  specifically,  consider  a  parameter  space  {((A*,B*,  C*, 
Dt))kef/-  :  ((Ae.Bt))*6iv  is  completely  readable ^  an  equiv¬ 
alence  relation  Rp  and  an  objective  function  E  where 

(i)  E  is  the  vector  valued  puise  response  or  its  one-  sided  z- 
transformation  of  a  discrete  linear  periodically  time-  varj'ing 
(LP)  system  LR((A*,R*,C*,i?i))*6^. 

E(z,i)  =  |C,A,-.i  •••AoAm.,  -1- 

C,-\A,-j  ■  ■  ■  AoAs-i  • '  •  C,^.\z  *]  • 

(z''7  -  A  -  •  •  AoAs-i  ■  •  •  A,+i)"'£.=~’  -b 

Az-'  for  i  €  A'*  (4) 

LP[{At,B,,C.,D,)),^s-  :  «(*)  -  y(<:) 

x(k  -r  1)  =  Akx(k)  -r  Beu(k),  y(k)  =  Ctx(k)  -i-  Dku[k) 

(Ak,Bk.Ck,Dk)  =  {At^.n,Bk~ff,Ck*N,Dk+r/)  for  all  k  (5) 

(ii)  [{Ak,Bk)]kerf  \s  called  completely  reachable  if  each  reach¬ 
ability  matrix  R,((Ai,£*))*ew-  for  i  €  A"  is  full  rank 

Ri[{-Ak,Bk))keff'  =  AiBi-i  AiA,-\Bi-i  •••]  (6) 

This  matrix  relates  the  state  zi't-r  1)  to  past  inputs  u(k),  k  <  i. 

(iii)  i{Ak,Bk,Ck,Dt])keff  Bp  ((Ai,Bi,C;,i?i))»eN-  iff  tnere 
exist  nonsinguiar  matrices  (TD.e//-  such  that 

AkTk-i  =  TkA'k,  Bk  =  TkB'k,  CkTk-i  =  Cl,  Dk  =  D'k  (7) 

Note  that  E  is  invariant  under  Rp  as  in  (1).  Now,  we 
intend  to  derive  canonical  forms  {A}  as  in  (2)  and  (3)  for 
{((A*,B*,Ct,Z?i))*gjy.  :  ((A*,Bt))*6^,.  is  completely  reach¬ 

able  }  in  (6)  under  Rp  in  (7)  with  respect  to  B  in  (4).  Since 
a  system  (S)  is  involved,  such  A  is  simply  called  a  reachability 
canonical  form  for  discrete  LP  systems. 

In  our  derivation,  it  will  be  very  convenient  to  con¬ 
sider  an  equivalent  quadruple  (A,,B,,  C,,D,)  to  a  4A’-tuple 
\XAk,Bk,Ck,Dk))k^N-  induced  by  a  map  E 

E  :  ((A*,R*,C*,D*)).6„.  -  (A.,B.,C..D.)  (8) 
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The  map  E,  which  is  called  hereafter  the  extended  form,  pre¬ 
serves  the  complete  reachability  of  {{Ai,,  Bt))ien’  in  (6)  and 
the  equivalence  relation  Rp  in  (7)  in  the  following  sense. 

LEMMA  1:  ((A*, B*))k€/#-is  completely  reachable  iff 

(A,,B,]is  reachable. 

PROOF:  Observe  the  reachability  matrix 

R(A..B.)  =  [B.|A.B.l.--|A'^"-‘B.i  (9) 

After  each  submatrix  A'.B,  for  I  6  (iVn)’  is  block-diagonalized 
by  a  column  change  operation,  the  whole  matrix  can  be  block- 
diagonalized  by  another  column  change  operation 

ii:(A,,B,)T'j  =  diag(  [Bo  A^Ah-iBs-j  •••!, 

[Bi  AiBo  AiAoBs-i 

[Bn-i  As-iBs-t  An-iAs-jBs-z  •••])  (10) 

where  T,  denotes  those  column  change  operations.  Since 
each  diagonal  component  in  (10)  is  the  reachability  matrix 
B.((Ak,B»))i6Ar-  defined  in  (6),  we  proved  the  lemma. 

LEMMA  2:  Let  Rc  be  an  equivalence  relation  defined  by: 
(A„B.,C.,D.)  Re  (A'„B'.,C;,D'.)  iff  there  exists  a  diagonal 
nonsingular  matrix  To  =  diag  such  that 

A.Tn  =  TDA'.,  B.  =  TdB'.,  C.Td  =  C'.,  D.  =  D'.  (11) 

Then, 

((Ak.Bk.C*,!?*))*^^-  Rp  ((A^,B^,C[,D[])keS’ 
o  (A.,B..C..D.)Be(A;,B:,C:.D;)  (12) 

PROOF:  By  direct  calculation  of  (11),  we  obtain  (7).  ///// 
For  showing  our  main  result,  let  (A,,B,,  C,,D.)r  and 
Ak,Bk,Ck,Dk))kr  denote  (A„B„C„D,)  and 
Ak,Bk,Ck,Di,))keH'  of  which  (A.,B.)  and  ((A»,B*))k€;/-  are 
respectively  reachable  and  completely  reachable. 

THEOREM  1:  If  a  map  F,  is  a  canonical  form  for 
{(A,,B., C,,D«)r}  under  Re,  then  a  composite  map  E~^T,£ 
is  also  a  canonical  form  for  {((Ak,Bk,Ck,Bk))cr}  under  Rp. 
PROOF:  (i)  The  map  £'”‘r,B  should  satisfy  (2). 

((A*,Bk.C*,i7k)).,  Rp  {[A'k,B'k,Ck,D'k)U 
o  (by  LEMMA  2) 

B((A»,Bk,Ck.Bk))„  Be  b((a;,b;,  c;,Bi))„ 

o  (since  both  sides  of  the  above  are  reachable  by 
LEMMA  1,  and  F,  is  a  canonical  form  as  in  (2)  ) 
F.B((Ak.B.,Ck.Bk))„  =  F.i:((A;,B;,c;,B'j)„ 

B-*r.B((Ak.Bk,Ck,B*))„  =  B-‘r.£((A;,B;,c;,B;))„ 

(ii)  The  map  B"‘F,£  should  satisfy  (3). 

(since  F,  is  a  canonical  form  as  in  (3)  ) 

F.(A.,B.,C..D.),  Re  (A.,B., C.,D.), 

=>  (by  the  definition  of  E  in  (8)  and  LEMMA  1) 
F.B((Ak.Bk.Ck.Bk))„  Be  B{(A*,Bk.C*,Bk))., 

=>  (by  LE.MMA  2) 

=>  E  'F,B((Ak,  Bk,  Ck,  Bk))cr  Bp  ((Ak,  Bk,  Ck,  Bk))cr 

We  proved  the  theorem.  ///// 

Although  it  is  relatively  easy  to  find  a  canonical  form  F 
of  E[(Ak,Bk,Ct,Dk))ksf/-  such  as  the  ’reachability’  canonical 
form  with  the  Scheme  II  l2j  (since  the  notions  of  reachability 
and  controllability  are  different  for  discrete  time  systems),  we 
need  to  find  a  canonical  form  F,  which  is  of  the  extended  form 
as  in  (8).  This  is  because  we  can  apply  the  inverse  map  B“*  to 
obtain  a  canonical  4iV-tuple  according  to  THEOREM  1.  Let 


us  introduce  such  extended  canonical  form  Fi,.,,  and  hence  a 
reachability  canonical  form  B~‘Fa„,,£: 

((A  Bk,Ck,Bk))kew  ((Ajc.k, Bi„,k, CjM.k, £i.»,k))k6/»*(13) 

which  is  called  hereafter  the  Kronecker  canonical  form. 

ALGORITHM  for  the  KRONECKER  CANONICAL 
FORM: 

(i)  Construct  (A,,B,)  by  £  for  a  given  {(Ak,Bk))k€H‘ 
is  completely  reachable,  and  search  independent  columns  in 
B(A,,B,):  Recall  that  each  column  of  B(A,,B,)  corresponds 
to  each  cell  of  the  following  Crate  Diagram,  e.g.,  for  the 
case  n=5,  m=2  and  N=3.  Observe  that  all  columns  of 
B,((Ak,Bk))k6/v-  (with  the  proper  zero-augmentation)  are  cor¬ 
responding  to  i-numbered  cells.  See  (6)  and  (10). 


(14) 

More  specifically,  the  m  cells  of  Aj'”’*  [0  •  •  •  0  Bj  0  •  •  •  Oj^  are 
equal  to  (A,  •  •  •  AtjAn-i  •  ■  •  Ak,!)' A,  •  •  •  AoAy.i  •  •  •  A,viB<  with 
the  proper  zero  augmentation  for  all  r  6  »*,  s  6  N'  and »  £  m*. 
The  search  procedure  is  principally  the  same  as  for  the  usual 
Scheme  H,  from  left  to  right  until  all  Nn  indepradcnt  cells  are 
founded,  except  that  the  dependency  test  for  an  i-numbered  cell 
only  needs  to  be  done  regwding  to  all  previous  independent  >- 
numbered  ceils.  Once  a  cell  turns  out  to  be  dependent,  set  up 
the  dependency  equation,  which  will  be  utilized  in  step  (iii). 
The  positions  of  dependent  cells  would  be  marked  by  big  O  as 
in  (14).  The  positions  are  actually  specified  by  the  reachability 
index  (ki,lc],...,l;(v,,),  for  an  instance,  (2, 3, 2, 3, 2, 3). 

(ii)  The  ordering  of  independent  cells  is  now  chosen  differently 
from  the  Scheme  H,  so  that  the  resulting  matrix  is  diagonalized. 
The  procedure  is  as  follows:  Order  ail  i-numbered  independent 
cells  to  form  a  matrix  such  as  for  t  =  0 

To  =  [[Boil  Ao[By_i]i  •••[[Boll  Ao[B/v-i]j  ■••] 
where  the  subscript  1  or  2  of  [Boj,  for  instance,  stands  for  the 
first  or  second  column.  Form  a  transformation  To 

To  =  diag  (ri).6iv.  (15) 

(iii)  Find  an  extended  canonical  quadruple  F zto.t 

Fjkk,#  :  (A,,  B,,  C,,  B,)  ►  (A3kk,k,Bikk,k,  )  (16) 

A,To  —  ToAjkk.k  B,  —  ToB]kk,f  C,To  —  D,  =  Bie*,, 
For  obtaining  Aju..,  utilize  the  dependency  equations  found  in 
step  (i). 

iv)  Applying  £”*,  we  finally  obtain  the  Kronecker  canonical 

(Ajak.k  .Bju.k  ,Cztm.k  ,  £]<k.k)  )kEA^' • 

REMARK  1:  In  step  (i),  we  should  always  find  n  indepen¬ 
dent  cells  in  ail  t-numbered  cells  for  each  i  €  N’ .  To  show  this, 
rewrite  the  corresponding  reachability  matrix  B,((Ak,Bk))k€:v- 
in  (6) 

B,((Ak,Bk))kE;r.  =  |VV.  ♦,W.  •  •  •] 

W,  =  [B,  I  A,B,-i  I  —  I  A  ■  •  •  AoAk^-i  •  •  • 

♦,  =  AA-i  •  •  ■  AoA/r-i  •  ■  ■  A<.i  (17) 
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By  usuxnption  of  the  complete  reach»bilit)',  is  full 

rank  for  each  i,  and  the  number  of  independent  colunms  is 
now  the  dimension  n  of  the  monodromy  matrix  In  step 
(ii),  we  might  order  independent  cells  according  to  the  order¬ 
ing  procedure  of  the  usual  Scheme  II.  The  resulting  transfor¬ 
mation  would  yield  the  reachability  canonical  form  with  the 
Scheme  II  (Ajo.Brot  Cjc.D}.,),  which  is  similar  to  the  canon¬ 
ical  quadruple  (A]..,,  .B)..,,  in  (16)  by  column 

and  row  permutation. 

EXAMPLE  1:  is  arbitrary,  and 

((A*,B*))»gy  is  given  as 


(Ao,Bo)  = 


48  192  -4  -42 

3  12 

-16  -63 
-22  -88 


30 

0-1  4 

0  6  -16 
1  15  -19 


-1 

0 

0 

0 


-11  n 
0 
1 

4 


VL 

10 

40 

0  -4 

12  J 

L 

0 

-ij 

J 

•  0 

-3 

-4  -1 

-8 

f  ° 

1 

0 

20 

-2  -4 

-19 

0 

0 

(Ai,Bj)  — 

1 

36 

-13  -8 

-56 

1 

-2 

0 

0 

21 

9  -1 

5 

-1 

-2 

1 

0 

-38 

7  9 

<<5  , 

0 

-1 . 

; 

f 

-124  - 

■155 

10  -21 

-78 

0 

0 

N 

23 

28 

-2  4 

14 

0 

0 

(A,,B,)  — 

327 

467 

-22  44 

239 

t 

-1 

4 

— 

85  - 

126 

5  -10 

-65 

0 

-1 

1 

7 

13 

0  0 

7 

.  0 

0 

J 

(i)  Searching  dependent  columns: 

Jlol(Ak,Bt))k€3‘  —  |jBo  I  AcB]  I  AoAjBj  |  A0A2A1B0 


* 

-1  -11 

4 

26 

48  0 

•  -106 

.  . 

0  0 

0 

1 

3  0 

-5 

.  * 

ss 

0  1 

0 

-6 

-16  1 

•  24 

-  . 

0  4 

-1  - 

11 

-22  0 

46 

0  -1 

0 

4 

10  0 

•  -18 

•  • 

■Ri((A*,B*))*6j.  *  [Bj 

1  A) Bo  1  AiAoBj  j  AiAoAjBj  | 

•  0  1 

0 

0 

1  0 

.  -4 

•  •  1 

0  0 

0 

1 

4  0 

•  -2 

•  • 

— 

-2  0 

-1 

0 

12  4 

.  -13 

•  • 

-1  -2 

0 

0 

1  -2 

9 

•  • 

.  0  -1 

0  - 

■2 

-9  1 

7 

(18) 


(19) 


ifj((As,B*))*cj-  =  [Bj  I  AjBi  I  A2A1B0  I  A}i4iAojBs  1  •] 


•  0 

0 

1 

--4 

-10 

1 

4 

.  .  ’ 

0 

0 

0 

1 

2 

0 

•  -2 

.  . 

-1 

4 

0 

0 

22 

-11 

•  63 

.  . 

0 

-1 

0 

0 

-5 

4 

•  -25 

.  • 

0 

0 

0 

0 

0 

-1 

•  7 

•  *  ^ 

(20) 


The  reachability  index  turns  out  to  be  (2, 3, 2, 3, 2, 3).  Setting 
up  the  dependency  equation: 


AoAj[Bi]i - ^l^oli  ~  4-  2[Bol}  4-  3Ao|B]]j(21) 

AoAjAilBoj]  =  2[Bo]i  t  ~  2[Bo]r  —  SAolBjji 

-AAcAilBij,  (22) 

AiAo[Bj]i  =  — 3[Bj]i  —  6.di[BoIr  +  \Bi]2  ■r4Ai[Bo}}  (23) 
AiAoAilBijj  =  [Biji  T  ”Ai[Bo]i  ~  —  2Ai{£ol2 

— AiAo[Bj)r  (24) 

A, AiIBoli  =  -2(5, ]i  -  2^, [Bill  5[B,1,  -f  2A,[B, 1,(25) 
A,AiAo[Bj]j  =  2[B,]i  -r  3.4,|Bi]i  —  3[B,]j  —  2A,(Bij, 


— 7X,Ai(Bol, 
(ii)  Ordering  independent  columns  for  each  i: 


(26) 


To  =  [  (Boll  Ao  (B,)i  1  [BoJ,  Ao  (B,j,  AoA,  [Bi],  ]  (27) 

Ti  *  [  (Bi]i  Ai  [Boji  I  [Bi]j  Ai  (Bo]j  AiAo  (Bjj,  ]  (28) 

T,  =  1  lBj]i  Aj  |Bi)i  I  [Bjjs  A,  (Bi),  A,Ai  [Bo],  ]  (29) 


(Hi)  Form  T©  in  (15)  with  (27)  to  (29).  Applying  (16)  and  uti¬ 
lizing  (21)  to  (26),  we  obtain  (there  is  no  need  for  numerically 
calculated  Tn  yet) 

A,<..,  = 

0  -4  0  0  2  ' 

1-300  1 

0  200-2 
0  310-5 

0  001-4 

0-300  1 

1-600  7 

0  100-4 

0  410-2 

0  001-1 

0-200  2 
1-200  3 

0  500-3 

0  210-2 
0  001-7 


f 

•  1  0  • 

10 

10' 

0  0 

0  0 

0  0 

Ba„,.  =  diag 

0  1 

t 

0  1 

1 

0  1 

0  0 

0  0 

0  0 

1 

.00. 

.0  0. 

.00. 

; 

Calculate  Tn  in  (IS)  with  (27)  to  (29)  and  (18)  to  (20).  Ap¬ 
plying  (16),  we  obtain  C,..,.  and 
(iv)  Read  out  nonzero  block  in  (A,«.,.  ,Bju,. 
according  to  B“*,  and  we  finally  obtain  ((A,^.*  ,B,„.i  , 

B2t,4))keN‘- 

In  fact,  each  value  of  a  canonical  form  is  completely 
specified  by  two  things,  a  structural  index  k  and  a  para¬ 
metric  quantity  8:  k  tells  which  positions  of  all  matrices 

B“*r,M.,B((At,B»,  Ck,  Dk))tes'  to  0  or  1,  and  8  gives 

real  values  for  the  nonfixed  positions.  In  the  canonical  quadru¬ 
ple  (A,u,t  ,B,«,,,  ,  of  EXAMPLE  1,  all  entries 

of  (Caw.«  *1^0  arbitrarily  determined,  and  the  invariants 

are  only  found  in  (A,*.^  «Bte>a)  i&  (30).  The  ordered  set  of  real 
numbers  on  the  non^ed  positions  in  (A,m,,  tBa«,<)  ut  (30)  and 
the  set  of  indices  indicating  the  fixed  0  and  1  entry  positions  are 
respecively  values  of  the  parametric  complete  invariant  8  and 
structural  invariant  k.  Specifically,  since  the  reachability  index 
(2, 3, 2, 3, 2, 3)  determines  the  fixed  0  and  1  entry  positions,  we 
can  write 


«:ru...(A.,B.)  =  (2,3,2,3,2,3) 

The  range  space  of  k,  however,  is  a  proper  subset 

Urn. 

*r,„.,{(A.,B.)}  C  {(ki.k, . ksm  :  IT  } 

J«X 

An  index  (2, 3, 2, 3, 1,4),  for  instance,  does  not  correspond  to 
K  for  reachable  {(A«,B,)}  in  t)ie  case  n  =  5,  m  =  2  and 
A*  —'3,  since  there  are  respectively  4A  and  6  independent  ceiis 
(columns)  in  groups  of  0,  1  and  2  numbered  cells  (in  matrices 
of  iZD((At,Bt))tgjv-,  i2i((At, Bt))se/f  and  B,((At,Bi))te^-), 
which  violates  the  assumption  of  the  complete  reachability  as 
mentioned  in  REMARK  1.  The  parametric  complete  invariant 
8  is  indeed  the  ’simplified’  parameter.  All  quadruples 
{(A„B„C.,D,),}  (and  hence  {((A*,Bt,C»,B»))„})  »«  P»- 
rameterized  by  x  and  8. 


m.  A  SYSTEM  DECOMPOSITION 

In  the  foregoing  section,  the  extended  form  (A.,B,,  C,,D.) 
played  a  key  role  in  the  derivation  of  the  canonical  form  for  dis¬ 
crete  LP  systems.  Now,  a  question  naturally  arises  what  the 
relationship  is  between  the  discrete  LP  system  LP[[AK,Bt,  C/,, 
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1 


Dk))ieff'  ind  the  LTI  one  C.,D.)  for  which  the  re- 

ailizAtioa  matrices  are  of  the  corresponding  extended  form.  We 
investigate  this  through  asystem  decomposition.  Let  M,,  t{k\, 
and  Q(/c)  denote  respectively  a  nonsingular  Nj  x  Nj  matrix, 
a  concatenated  vector,  and  a  diagonalized  matrix 

I,  ■ 

M,  =  .  (31) 

'  A  . 

t(k)  =  Ir^ (fc  -  iV  +  1)  . .  •  r^{k  +  1)  (32) 

Q(ifc)  =  diag  (t3(i:  -  iV  -  1) . Q[k-r  1),  Q(fc))  (33) 

Observe  that  M,  is  the  generator  of  a  cyclic  group  of  order  N. 
THEOREM  2;  A  discrete  LP  system  in  (5)  is  decomposed 

as 

LP{{A^,B^,Ct,n,))kis-  =  5*i(A.,B.,C.,D.)5  (34) 

where,  using  (31), 

S  :  u{k)  ^  v(fc):  v{k)  =  Ml;:'  u(fc)  (35) 

L(A.,B.,C.,D.):v(fc)  —  ^(k) 
z[k  -r  1)  =  A.z(fc)  -i-B.v(fc),  w(;c)  =  C.z{k)  +  D,v(fcj  (36) 

S' ;  w(Jt)  y(ik);  9(k)  =  w(fc)  (37) 

REMARK  2:  Let  i(A„B.,C„D,)  be  simplely  L„  and 
called  the  extended  invariant  system  of  Lf’((At,Bk,  C*, 
Dk]]k€S’‘  The  extended  invariant  system  L,  is  clearly  different 
from  the  other  invariant  models  [3][S|,  since  its  derivation  does 
not  appeal  to  the,  so  called,  lifting  operator  jSj.  For  a  given 
LP((A»,B»,C»,i?»))t6Aft  the  composite  system  S'L,S  indeed 
gives  its  l/O-equivalent  model  at  every  instant  of  time,  and  still 
L,  pertains  to  the  stability  analysis  and  design  as  shown  later. 

PROOF:  By  stacking  the  equations  in  (S)  for  times  {k  — 
JV  +  1,  ...  ,  /fe  +  1,  k),  we  obtain  another  LP  system 

Ii’((A»,B.,C.,fi»)):a(l:)  «  y(k) 
x(*  + 1)  =  A»Ji(ife)  +  a»a(fc),  ?(k)  =  C»x(fc)  +  fita(fc) 

(Ak,Bt,  CtflDt)  =  (A*.,./(f,fi*+Ar,  C»+Ar.D*+//)  (38) 

Consider  the  following  transformati 

v(ik)  =  M!L"-‘tl(ifc).  t(k)  =  M;3t(k),  w(fc)  =  M;*‘y(fc)  (39) 

By  applying  (39)  to  (38),  we  obtain  a  linear  system  with  the 
quadruple 

(Mr‘A*M;‘, 

m;*‘d*m;'‘"‘>)  (40) 

Observe  the  identities 

m;‘a*m,  =  A*.,  =  Bk*, 

M;‘CkM,  =  C.*,  =  0».k,  (41) 

Applying  (41)  to  (40),  the  quadruple  in  (40)  becomes 
(A.,B„C,,D,)  as  in  (8),  and  the  transformed  system  is  the 
LTI  system  L(A.,B.,C,,D,)  in  (40).  Moreover,  the  input  and 
output  traiuformation  in  (39)  yield  S  and  S'  in  (35)  and  (37), 
respectively.  ///// 

LEMMA  3:  A  discrete  LP  system  LP((A»,  Bk,  C*,  Dk))teff- 
in  (S)  is  asymptotically  stable  iff  its  extended  invariant  system 
L(A,,B,,  C,,l),)  in  (36)  is  asymptotically  stable. 


PROOF:  From  (39),  the  state  x(k)  of  LF  is  related  to  the 
state  t(k)  of  L, 

t{k)  =  M*  ir’’(fc  -N  *  1) . z^[k  -  l),z^(fe)!’' 

Due  to  the  boundedness  of  Mj,  [|  t{k)  |j  —  0  as  k  — •  oo  iff 
II  x{k)  II  —  0  as  k  — >  00.  Moreover,  since  S'  has  no  internal 
state,  and  ail  states  of  5  go  to  zero  for  the  zero  input. 
LEMMA  4:  The  following  identity  holds 

p.(z)  =  d«t(z/-  A.) 

=  det{XI  -  Ai-i  •  •  ■  Afi-iAo  •  •  -  A,)  =  p(A),  for  A  =  r''  (42) 


PROOF:  By  direct  cadculation  of  det(z/  —  Af,7’A,Afi). 
The  subsystem  of  S'L,S  have  unique  properties  whicn  are 
useful  for  the  feedback  connection. 

LEMMA  5:  The  following  identity  holds 

SS'L.S  =  L,S  (43) 

PROOF:  Let  u(k),  y{k)  and  r(Jc]  be  the  input  and  output 
of  S'L,S,  and  the  output  of  SS‘L,S,  respectively.  Then, 

y(k)  =  u{li:)  by  (37) 

r(k)  =  Sy(k) 

=  M;*‘y(k)  by  (35) 


Therefore,  r(ifc)  =  L,Sti{k),  which  means  the  identity  (43). 

LEMMA  6:  Assume  the  dimensions  of  all  subsystems  are 
compatible.  The  parallel  (addition)  or  serial  (multiplication) 
connection  of  the  extended  invariant  systems  as  in  (36)  again 
yields  an  extended  invarint  system. 

PROOF:  Let  (Aj,Bj,  Cj.Di)  and  (AJ,BJ,CJ,DJ)  denote 
the  relizations,  and  (v*(il:),z‘(k),w^(lE))  and  (v*(k),z*(k), 
w*(k))  variable  sets  of  the  input,  state,  and  output  of  the  two 

extended  invariant  systems  L,i  and  La. 

(i)  Let  v*(k)  =  v*(k)  =  v(ik),  w‘(k)  =  w*(k)  as  w(k),  and  the 
parallel  connected  system  is 

[r’(ife  +  l)]-[  0  A;J[z’(k)J  •  [bJJ^W 

w(k)  =  [cj  c;][‘!|j^j]^[D;^Dj]v(k)  (44) 

The  quadruple  of  (44)  is  not  of  the  extended  form.  But  by 
transformation  Tn 


■/„  0  0  0 

0  0  0  •••  0  ■ 

0  0  0  •••  0 

/„  0  0  •••  0 

0  0  •••  0 

0  0  0  0 

0  0  0  0 

0  0  0 

0  0  •••  0 

0  0  0  0 

0  0  0  •••  0 

0  0  0  7„. 

where  rti  and  rij  are  dimensions  of  L,i  and  La,  the  quadruple 

is 

(ii)  Let  v’(k)  =  w*(Jk),  v*(k)  =  v(fc),  and  u»’  =  ■w(k),  and  the 
serial  connected  system  is 

*'(*  +  l)l_f  Ai  0  ]  r  z'(k)  ]  r  BJ  1 

z’(k  + 1)  J  “  [  B?CJ  a;  z'(k)  ■  BJD) 

w(k)  =  [DjCi  CJ]  Jijj;}  ^[D|Di]v(k)(46) 

Again,  by  Tr  in  (45),  the  quadruple  is 

^  ((  BlCl  Al  '  B%1  '{ 


THEOREM  4:  The  following  two  feedback  systems  are  the 

same 


iSj  5,*  ryiSi  plL.i^S, 


1  Sj  pU..3-T - iitl'  !  1‘S'r  [ 


^ — LL.j— 


PROOF;  By  using  LEMMA  S  and  6. 

IV.  APPLICATION  TO  EIGENVALUE  ASSIGNMENT 

A  typical  control  application  of  the  canonical  form  derived 
in  section  II  is  the  eigenvalue  assignment  of  the  monodromy 
matrix  4o  (I^)  ^  completely  reachable  discrete  MIMO 

LP  system 

x(k  +  1)  =  A*i(Jt)  +  Btu(k),  v(k)  =  /.*(*)  (47) 

where  is  a  n  x  n  identity  matrix.  The  eigenvalue  assign¬ 
ment  problem  is  stated  as:  By  what  cyclo-static  state  feedback 
u(k)  =  Lkii(k)  is  the  monodromy  matrix  4a  of  the  plant  (47) 
controlled  to  =  (-dA^-i  ~  5^_iL/^_j)  •  •  •  (Ai  —  H|L|)(Ao  — 
BoLo)  such  that  for  a  given  desired  polynomial  p(A) 

det(A7  —  {As-i  ~  Bs-\Ls-i)  •••  (Ai  —  BiLi){Ao  —  BoLq}) 


= pw  m 

Although  the  eigenvalue  assignment  problem  has  been 
solved  [4],  our  approach  using  the  canonical  form  is  simple  and 
generalizes  the  time-invariant  case.  The  solution  is  eventually 
obtained  in  terms  of  the  cyclo-static  state  feedback  (I<s)se^* 
and  input  transformation  (Gt)terr* 

u(Jfe)  =  G*r(A:)  -  L*y(fc)  (49) 

However,  the  problem  is  first  considered  through  the  extended 
invariant  systems:  The  controlled  system  in  (47)  and  (49)  can 
be  understood  as  in  THEOREM  4  where 

LPi  =  LPl(At,Bt,In,0))kf/f’,  LPi  =  XrP((0,0,0,I<*))*esr-, 
and  ZPi  =  LP((0,0,0,Gt])ke/f'  The  overall  extended  invari¬ 
ant  system  Za(l  -r  b 

Xis(l  -r  L,iXi,j)~*L,i  =  (A,  —  B,G,G,  ‘L,M«,B,G,,Mn,0) 

(50) 

Let  the  characterbtic  polynomial  of  the  controlled  extended 
invariant  system  matrix  A,  —  B,L,M„  be 

de:  (zl  -A.-h  B.L.M„)  =  p'(z)  (51) 

then,  by  applying  LEMMA  4  to  (48)  and  (51), 

p'(z)  =  p(\)  for  A  =  (52) 

Therefore,  the  problem  b  solved  by  finding  L,  in  (51)  with 
proper  G,  for  p'{z)  given  from  p(A). 

Without  loss  of  generality,  we  show  the  procedure  for  finding 
L,  and  G,,  and  hence  (Lt)se/v  and  [Gk)kefi‘  '''■ih  an 

example. 

EXAMPLE  2:  A  completely  reachable  LP  system 
LP((Ak,B^,In,0))keN'  as  in  (47)  of  n  =  3,  m  =  2  and  =  2  b 
given.  Let  the  extended  invariant  system  be  L(A,,B„Mn,0) 
where  M„  b  as  in  (31). 

(i)  Using  the  ALGORITHM  in  section  H,  find  the  extended 
canonical  quadruple  of  (A,,B„Mn,0).  Let  us  assume  its 
structural  invariance  (reachability  index)  b  <c  =  (5,0, 1,0) 


I 

0 

® 

1 

[® 

A. 

1 

1 

® 

0 

Aj 

0 

0 

1 

1 

AJ 

1 

1 

0 

0 

AJ 

0 

0 

Ll! 

1 

a; 

1 

0 

Ll 

The  following  Tn  yields  the  extended  canonical  quadruple 
(A}M,>>B]M^,M„Tn,0)  of  (A.,B,,M„,0) 


Tp  =  diag  (To.Ti) 


(53) 


[lA 


ifili  X«A|  A^AtA^At  {B»]i 


1^1 


0  0 

A|A«A|  (Bo) 


.] 


(*A.i«»,i,Bj**^)  — 


/ 

00,0  0  0 

1  io.o 

'i 

0  10 

0  0 

0  0  1 

0  0 

0  0  Uo.i 

» 

1  ^,1 

1  0  Oj.i 

0  0 

V 

.  0  1  oj.i 

0  0. 

y 

(54) 


(ii)  The  following  Tua  of  Q  yields  the  usual  reachability  canon¬ 
ic^  form  with  the  Scheme  H  (Aj^Bj^,  MnTr>Q,0)  of 
(A.,B.,M,,0) 


(55) 


[fieli  0  (Aeli  0  a#Xia»a,  |n,li  o 

0  Ai  |Aa]i  0  XiX*Ai  [fieli  0  Ifilll  , 


T,e,  =  TpQ 


Q  = 


r  1 
0 
0 
0 
0 
0 


(56) 


(57) 
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0 
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■  1 
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1 
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0 

®J.l 

0 
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0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

,  0 

0 

0 

0 

«0.1 

0  J 

,  0 

0 

1 

4o.i  . 

y 

(58) 


(iii)  By  the  algorithm  j2,  pp.  434-437],  we  ob¬ 
tain  the  transformation  It  and  controller  canonical  form 
(A3„Bu,M,.TnQE.,0)  of  (A,.B..M„.0)  (see  the  derivation 
in  the  APPE-NDIX) 

0 


R  = 


1  0  -oj.j 

0 
1 
0 
0 
0 


0  1 
0  0 
0  0 
0  0 
0  0 


-Oj.I 

0 

1 

0 

0 


-Oi.i  0  ) 
0  0 
-oj.i  0 
0 
1 
0 


0 

0 

1 J 


(59) 


(Aj„Bj,) 


/ 

■  0 

flj.l 

0 

“1.1 

0 

“0,0  ' 

boss 

0 

0  ■ 
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0 

0 

0 

0 

“0.1 

0  . 

,  0 

0 

1 

fco.i  . 

y 

(60) 
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(iv)  AMume  th«t  &  desired  characteristic  polynomial  is  given 
u  p{X)  =  A*  -  p,A’  -piX-po-  Then,  by  (52), 

p'(x)  =  X*  -  Pix^  -piz  ~po 


(v)  Using  the  controller  method  [2,  pp.  500-5031,  we  now  intend 
to  control  the  system  i(Aj,,Bj„hi.ToQH..O)  to 
I(A',..Bi..M,TBQR.O)  (Ai..Bi.)  = 


I 

■  0 

Pi 
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Pi 
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Po  ' 

■  1 
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.  0 
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0 

.  0 

0 
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0 

; 

(61) 


by  the  input  transformation  C,  and  state  feedback 
G7*L,M,TnQR  such  that 


B;.  =  B,.G.,  A^  =  A,.  -  B,.G.G.-'L.M«TnQRA',.  (62) 

(vi)  From  (60)  to  (62), 


■  1  -6qo  0  0 

0  10  0 

0  0  1  -io.i 

0  0  0  1 


(63) 


k,  0  Jk,  OJt,- 
0  0  0  0  0 
0  0  0  ^4  0 
00000. 

where 

fci  =  <*0.0  —  Po*  ^  =  <*1.1  ~  Pi 
hi  —  «},i  -  Ps.  =  uo,j  ~  1 
From  (53),(S7),(59),(63)  and  (64), 


From  (60)  to  (62), 

Gr‘L.M,TnQR  = 


[[‘o'  ‘S 


*1,1^3  +  kj  ] 
0 

0 


0 

0  0  k4 

0  0  0 


(64) 


(65) 


Observe  that  G,  in  (63)  and  L,  in  (65)  are  always  block- 
diagonal  (hence  the  subscript  ’e’  is  justified).  This  is  clear  from 
as  follows;  (Au,Bu)  in  (60)  is  similar  to  (Ai„.„Bj„,,)  in  (54) 

A^..  =  QR-‘A,.Qa,  Bu.,.  =  QR"‘B:,  (66) 

Moreover,  since  (Ai^.B^,)  =  (Ai„Bj,)  except  that  a©,]  =  1, 
4o.o  =  0.  *ad  6o.i  =  0,  there  exist  ( A',^.„  Bi^,.)  which  is  similar 
to  (A'^,,Bi,  ,),  and  is  of  the  extended  form.  Specifically, 

AL..  =  QR-‘a;,QR,  Bi„,.  =  QR-‘B;.  (67) 

G,  and  L,  are  now  diagonal  respectively  from  (62), (66)  and 

(67). 

(vii)  Finally,  the  controlled  extended  invariant  system  is 
obtained  as  L(Au  ~  Bi«L,M„TnQR. B3,G„MnToQR,0), 
which  is  similar  to  i(A,  —  B,L,M„,B,G<,Mn,0).  This  is  the 
overall  extended  invariant  system  in  (50)  and  THEOREM  4, 
and  hence  the  input  transformation  G,  and  the  state  feedback 
L,  is  realixed  with  the  cycitvstatic  input  transformation  and 
state  feedback  LP^  =  LP(0,0,0,Gt)  tnd  LPt  =  (0,0,0, Lt}, 
respectively. 


V.  CONCLUDIN’G  REMARKS 

Another  reachability  canonical  form  can  be  obtained  us¬ 
ing  the  Scheme  I  instep  of  Scheme  11  in  the  ALGORITHM. 
Observability  canonical  forms  can  also  be  found  by  using  the 
same  steps  in  the  ALGORITHM  and  the  obvious  duality.  These 


canonical  forms  are  useful  mathematical  tools,  applicable  to  the 
modeling,  robust  control  and  multi-rate  systems. 

APPENDIX 


For  simple  notation,  let  A  =  Au>>  xii  =  1Bim|i 
1B,«1,.  From  (58), 

and  eii  = 

A*eii  =  oi.j A^Cjj  oi.iAeii  oa.iejx 

(68) 

Acu  =  oo.o^ii 

(69) 

Let 

A*cji  —  a^.iA'Cii  —  Oi.iCii  =  ejs 

(70) 

then,  from  (68)  and  (70), 

Acis  — 

(71) 

Let 

A’cii  —  oi.iAeii  =  ei4 

(72) 

then,  from  (70)  and  (72), 

Acn  =  Ox  jCii  cts 

(73) 

Let 

A'cxx  —  Oj.iCii  =  cij 

(74) 

then,  form  (72)  and  (74), 

Aeij  =  ei4 

(75) 

Let 

Aeii  =  cii 

(76) 

then,  from  (74)  and  (76), 

Aexs  =  Oj  iCii  -r  Cxs 

(77) 

Form  a  transformation  R  s  icn  txt  ttt  cu  Xtt  cstj*  and  we 
obtain  (59)  and  (60)  by  applying  the  formula  At^R  =  RAi^, 
B,«  =  RB^  with  (69),(71),(73),(75).(76)  and  (77). 
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Abatnct 

The  synthesis  of  minimum  sensitivity  state-space  realizatioas 
of  linear  time  invariant  systems  is  a  well  understood  problem. 
Such  realizations  have  been  linked  to  balanced  realizations.  In 
thin  paper,  the  theory  is  extended  to  the  synthesis  of  minimnm 
sensitivity  gtneraliitd  state-space  modeb  for  singular  linear  sys¬ 
tems.  A  scalar  sensitivity  measure  m  first  defined.  Then  the 
minimization  of  the  measure  b  consider  over  all  admissible  re- 
alizationa.  Since  minima]  realizations  are  not  required  to  be  re¬ 
lated  by  similarity  transformation,  the  optimization  problem  b 
noote  complex.  A  criterion  a  given  for  determining  optimal  sen¬ 
sitivity  stmctnres.  In  the  nonsingnlar  case,  the  criterion  lednces 
to  the  familiar  result.  The  simple  example  of  the  right-ehift  op¬ 
erator  b  considered. 


I  Introdnction 

The  sensitivity  of  state^pace  realizations  of  linear  time- 
invariant  systems  has  been  a  subject  of  oonsiderabb  interest 
to  leseardieTs  daring  the  past  five  yean  {8,9,10,14,15,16,17,18|. 
Mnch  of  the  motivalion  for  thb  researdi  has  been  the  desire 
to  design  digital  filters  and  analog  networks  with  mhiimnin  pa¬ 
rameter  sensitivity.  At  present,  the  sensitivity  theory  fm  thb 
class  of  systems  a  well  established.  The  purpose  of  thb  paper 
b  to  stndy  the  sensitivity  of  ftmcniued  state-space  raalizatioaa 
for  singnlar  linear  systems.  An  exampb  from  thb  class  b  the 
system  described  by  the  set  of  difference  equations  of  the  form 

=  Ax»  +  bth  (I) 

y*  =  (2) 

where  ^  b  a  singular  n  X  n  matrix  each  that  the  semi-state  vector 
z»  €  9"  a  defined  impUdtty.  We  restrict  the  dbeosMon  here  to 
the  dberete-time,  sin^variabb  case  for  brevity,  bat  much  of 
the  foUowing  development  can  be  extended  to  the  eontinnons- 
time  and  mnltivaria^  cases. 

In  the  sessf  linear  system  case,  where  £  =  /.  it  b  weO  known 
that  state-variabb  modeb  are  not  unique.  Any  two  minimal 
itats  spacs  realizatioas  of  a  given  transfer  function  are  related 
by  a  siniiiarity  transformation,  Le.,  by  a  change  in  basb  for  the 
rtitn  spirr  In  the  singniar  case,  however,  minimal  lealisatioru 
are  not  necesearily  related  by  similarity.  The  set  of  adminribb 
Craasformatioaa  m  mack  larger.  Thus,  determining  optimal  sen- 
eitivity  rsalbationa  for  the  tingnlar  case  b  a  non-triviai  extensioa 
of  existing  theory  for  the  usnal  linear  case. 

An  ontline  of  the  paper  b  as  foQows.  We  begin  by  defining 
•ennitivity  measures  in  a  manner  analogous  to  the  nsnal  Unear 


case  |14, 15, 16,17].  Ueiug  these  measures,  the  probbm  of  iden¬ 
tifying  optimal  generalized  state-space  structures  b  considered. 
The  theory  b  then  applied  to  the  exampb  of  the  right-shift  op¬ 
erator. 


Given  the  system  ( 1  )-(2),  the  corresponding  transfer  function 
b 

A(2)=c(2£-A)-'4.  (3) 

Any  matrix  pair  {U,V)  €  GLl{9)  ^  GL.(S)  x  GI>.{S)  will 
provide  another  order  realization  of  the  transfer  function  by 
applying  the  action 

;  IV, V)  x  IE,  A,b,c)^  {U-'EV, V-'AV,  tf-'fi, tV),  (4) 

where  E.  b  defined  to  be  the  set  of  all  order  realizations. 
In  general  b  b  not  a  siniiiarity  action  since  U  and  V  may  be 
distinct.  When  n  corresponds  to  the  mmimal  order  reqoired  to 
realise  A(2),  then  afl  miiinn*]  realizations  of  h(z)  Ue  on  an  orbit 
of  ^{71. 

Ihe  sensitivity  of  the  transfer  fnnetion  to  the  realization  pa¬ 
rameters  b  described  by  the  sensitivity  fonctioss; 

dhjz)  3h(2)  dkjz)  ffhfz) 

117’  '  dk  ’  dc  ■  '  ' 

Anafogons  to  the  resnlts  for  the  usnal  linear  system  case  reported 
in  |14,15.16,17],  we  have  the  following  lemma. 

Lemma  Z.1  Dtfiue  Uu  veetm 

f[z)  =  [zE~A)-'b  (6) 

,(2)  =  (7) 

then 

dh{z)  .  ,  dk(z)  _ 

^  ^  =  (9) 

Pnof.  The  expreseiona  in  (8)  are  obvious  generalizations  of 
those  in  [14,15.16,17].  The  expressiOBS  in  (9)  are  proven  using 
Krooeder  matrix  aigeixa  |3,4j  as  follows  : 

=  (/.  ®  c(2£-  A)-'*). 


rM-jc^r-T/BQ/rvYw  iTiysi  on^iqflqicEE 


where  U,„  =  IX.1  ELt  S  £1* ,  and  E,»  is  an  n  x  n  matrix 

with  unity  at  the  (i,it)  position  and  rerow  eisewbere.  Thus, 


dhlz) 

bA 


dhlz) 

IT 


(^) 

(/.  ®  b^{z£7  -  A^)-' 0  {zET  - 
(/.  0  6’‘(2£^  -  A^)-')[‘rEr  -  A’T'c’’  0  /.) 

{{zET  -  .4’‘)-‘c^)  0  (fc’’(z£^  -  /’•)-') 
(2£^-X’‘)-‘cV(2£^ ->!’■)“ 

=  -9lz]r(z)z, 


where  A,  /  and  g  are  defined  by  (3),  (6)  and  (7),  respectively, 
with  the  roles  of  A  and  E  interchanged.  ■ 


Since  ultimately  we  wish  to  optimize  a  scalar  measure  of  sen¬ 
sitivity,  define  the  following  integral  measure  for  matrix  valued 
functions  of  z 

\\G\\,=  (^-^f\\G{z)\y,z-'dz^\  (10) 

where  ||  •  ||f>  denotes  the  Frobenius  norm.  By  Parseval’s  theorem 
it  is  clear  that  the  norm  ||  - 1|,  is  a  matrix  generalization  of  the 
nsual  If  norm  for  sequence  spaces.  When  p  =  2  such  a  norm 
has  the  physical  interpretation  of  energy.  The  scalar  sensitivity 
measure.  A/,  is  defined  as 


This  b  a  generalization  of  the  sensitivity  measure  developed  in 
|14,lo, 16,17]  for  state-epace  realizations.  The  use  of  two  different 
matrix  norms  in  (11)  is  strictly  for  mathematical  convenience. 

Rather  than  minimizing  M  directly,  it  is  more  tractable  to 
minimize  a  particular  upper  bound  on  M.  We  refer  to  the  re¬ 
sulting  realizations  as  bound-optimd.  Observe  that 

=  l9(^)r{4r  =  ll/(")llr  •  (12) 


Thus,  it  follows  from  the  Cauchy-Scfawarz  inequality  that 


=  IM!’II/•/l!^  (H) 


where  ffr)  =  2.  Letting  {/,}  and  {}<)  denote  the  inverse  2- 
transform  of  f{z)  and  j(2),  respectively,  it  follows  from  Parse- 
vaJ's  theorem  that 


II 

^  1  \\9\z]\^  z~' dz 

(15) 

= 

y  iisu 

«c 

(16) 

11/ -All*  = 

(17) 

= 

^  /m.i/*+1 

(18) 

= 

00 

E  rji- 

(19) 

(Note  that  \\f  •  /jj,  =  ||/||,.)  Thus,  M  is  bounded  above  by 

Af *  =  2  ^  /T/*  ■  ^  sjsf^  ^  9iSt-  (20) 

A— «o  £s«od  JtS^CO 

M  =  Af*  if  and  only  if  there  exist  scalar  a  such  that  g^  —  a Jt 
for  every  integer  k. 

For  singular  systems  the  expressions  for  Jf  and  gi  can  not 
be.  given  expUdtly  in  terms  of  the  realization  parameters  (as  is 
the  case  for  usual  linear  systems).  However,  the  sequences  can 
be  represented  implicitly  in  terms  of  the  system’s  fundamental 
matrix  |11,12].  The  unique  Laurent  expansion  about  the  point 
at  infinity  of  the  resolvent  matrix  [zE  -  i4)”‘  is  given  by 

(2r->l)->  =  2-  £  (21) 

where  p  >  1  is  the  bdex  of  nilpotency  of  the  matrix  pencil 
(zE  —  i4).  The  fundamental  matrix  (pt)  satisfies  the  recursion 
relations 


E4>k-Ad>..x  =  SffI  (22) 

dkE  —  bk-iA  —  fio*/.  (23) 

(Aa  is  the  Kronecker  delta.)  b  terms  of  the  fundamental  matrix, 
{/t}  and  {^t}  ve  expressed  as 

ft  —  bfbu,^  (24) 

gt  =  ail  (25) 


where  Ui  is  the  unit  step  function.  Hence,  another  expression 
for  M’  is 

A/*  =  2  £  £  cpipfr 


+  y  (26) 

For  the  nsual  Ibear  system  case,  bt  =  for  k  >  0  (zero 
otherwise)  and  p  =  1.  In  this  case,  the  summatbn  bvohrmg 
b  m  (26)  can  easily  be  shown  to  be  equal  to  the  trace  of  the 
reachability  giammian.  Likewise,  the  summation  mvolvmg  c  is 
equal  to  the  trace  of  the  observability  gnmmian.  The  set  of 
bound-optimal  state-space  structures  are  those  that  minimize 

K  =  =  0  (27) 
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over  all  possible  realizatioDS.  The  bound-optimal  set  is  charac¬ 
terized  by  the  property  that  each  member  has  its  reachability 
graminian  equal  to  its  observability  gTammian.  Such  reaiizatioits 
are  said  to  be  essentially  balanced  |8,18|.  In  order  to  establish 
an  analogous  result  for  singular  systems,  we  wish  to  express  iW* 
in  terms  of  grammian  matrices.  There  are  at  least  two  ways 
to  define  the  reachability  and  observability  grammians  for  sin¬ 
gular  system.  Each  de^tion  ultimately  depends  on  how  the 
concepts  of  reachability  and  observability  are  extended  from  the 
nonsingular  to  the  singular  case.  This  is  still  an  active  area  of 
research  [1,2,6,7,11,13].  Consider  first  the  reachability  and  ob¬ 
servability  matrices  defined  by  Lewis  in  |11.13|.  The  subscript 
"S’  indicates  that  these  definitions  follow  from  the  notion  of 
reachability  and  observability  in  the  symmetric  sense. 


Definition  2.1  [11]  For  the  system  (l)-(2)  with  \  zE-A\^0, 
define  the  symmetric  reachability  and  observability  matrices  as 


^sAE,A,b]  = 


and 


[  Oijb  0j6  ... 

d-,-ib  ...  0^}b  6-ib 


COa  cd__, 
ct>i  ;■ 


OsAE,  /I,  c)  = 

Cd-5 

cdj  eo-i 

respectively,  }or  some  non-negative  integer  i. 


(28) 


(29) 


In  terms  of  the  matrices  in  the  definition  above,  the  natural 
definitions  for  reachability  and  observability  grammians  are  as 
follows. 


Definition  2.2  For  the  system  (l)-(2)  with  |  rJ  -  .4  jji  0, 
define  the  symmetric  reachability  and  observability  grammians 
as 

and 

respectively. 

It  is  easily  shown  using  (22)  and  (23)  that  the  steady-state 
grammians  (when  the  limits  exist) 


Psa  -  Zs.iZl^ 

(30) 

Qs.i  = 

(31) 

0 


Fs  ^  lmFs.,= 

oo  ^ 

satisfy  the  matrix  equations 

E.FsE:  =  AgPsAl^E^S^Ej; 


where 


EJ^QsEa  =  A^QsAe  +  E^c^cEa, 
Ac  = 


Ea  = 

= 

e^c  = 


E  0 
0  A 


A  0 
0  E 


<babb'^0l  0 
0 


(32) 

(33) 

(34) 

(35) 

(36) 

(37) 

(38) 


Another  nsefnl  reachability-observability  grammian  matrix 
pair  comes  from  the  following  definition. 


Definition  2.3  [1,13]  For  the  system  (ll-(2)  mth  \zE—  A\^ 
U,  define  the  forward  reachability  and  observability  matrices  as 


Z,lE.A.b}  =  U.,b  ...  d>-ib  pa’o  dih  ...  g>,b  I 

(3^) 

and 

■  cd-,. 


0,{E,A,c)  = 


CO-i 

cdo 

CO| 


(40) 


I  J 

respectively,  for  some  non-negative  integer  i. 


In  terms  of  the  forward  reachability  and  observability  matri¬ 
ces,  the  natural  definitions  of  the  reachability  and  observability 
grammians  are  given  below. 


Definition  2.4  For  the  system  (l)-(2)  with  |  zE  —  A  ]5^  0, 
define  the  forward  reachability  and  observability  grammians  as 

P.  =  Z.ZJ  (41) 

and 

Qi=croi,  (42) 

respecthely. 

It  is  easily  verified  that  the  steady-state  forward  grammians 
(when  the  limits  exist) 


P  —  lim  Pi  =  V  ptblFibl 

*=-» 

(43) 

Q  ^  lim  Q.  =  F  dieted* 

•  iCB 

(44) 

satisfy  the  equations 

EPE^  =  AFA^-t-{E6ablF<bIEP'-Ao.ibb-^<?Z,A^)  (45) 

EP'QE  =  A’’QA-(-(£TdJ‘c^cdo^-/4’'dI,c^cd_,/t).(46) 

For  the  usual  linear  case,  equations  (45)  and  (46)  reduce  to  the 
familiar  Lyapunov  equations.  (Note  that  Definition  2.4  is  closely 
related  to,  yet  distinct  from,  that  given  by  Bender  in  [1].  The 
main  difference  b  that  P  and  Q  as  defined  above  will  always  be 
non-negative  definite.)  The  main  result  of  this  section  is  now 
given  in  the  following  lemma. 


Lemma  2.2  For  the  system  in  (l}-(2),  the  sensitivity  measure 
M  siAisfies  the  inequality  M  <  M',  where 

M'  =  tTrPTrQA-TrP^TrQ  (47) 

=  rrrPsTrQs  +  Trp!!  +  TrQs.  (48) 

Proof.  Substitute  the  definitions  for  P  and  Q  into  (47).  Equa¬ 
tion  (48)  follows  directly  from  (47),  since  Tr  Pg  =  Tr  P  aad 
TrQs  =  TrQ.  . 
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m.  MTNTMT'M  cPNglTINTrY  REALIZATIONS 

We  first  consider  the  probiein  of  tninimuing  M'  over  «// 
minimal  realizations  of  a  fixed  transfer  function  /i|r).  b  view  of 
(4),  the  effect  of  a  transformation  pair  ([',  V')  on  the  grammians 
Ps  and  Qs  is  given  by  the  action 

Vs  : 

:  (U,V]x(Ps,Qs)^{V-^PsV--^,U^QsV],  (49) 

where  V  ~  I,®  U  and  V  =  I,®  V.  Similarly,  the  effect  of  a 
transform  pair  on  and  Q  is  given  by 


;  {V,  V)  X  (?.  Q)  -  {V-’  PV--^,  ITQU).  (50) 

Sbce  {U,V)  can  be  selected  arbitrarily  from  dJ(S),  then  for 
some  given  (FiQ)  apply  the  transformation 

u.  =  J.;  V.  =  (51) 

with  f  >  0.  Then, 

A/-  («)  =  2f  Tr  PTrQ+€^TrP  +  c^Tr  Q.  (52) 

Thus,  the  sensitivity  measure  A/  can  be  made  arbitrarily  small 
smce  0  <  Af  <  A/‘(e).  This  is  a  phenomena  that  does  not 
occur  m  the  usual  Ibear  system  case,  primarily  because  the  set 
of  similarity  transformations 

S^{(C;,K)€GIi(»)lC/  =  K}  (53) 


is  much  more  restrictb'e.  Observe  that  (U„  V,)  is  not  an  element 
of  5. 

From  a  practical  point  of  view,  the  transformation  4>(u..v.)  is 
not  very  useful  The  resultmg  realizations  for  small  values  of  e 
would  have  poor  quantization  properties  sbce  the  components 
of  E  and  A  could  be  made  arbitrarily  large  relative  to  the  com¬ 
ponents  of  b  and  c.  The  problem  hereb  is  to  sufficiently  restrict 
the  set  of  admissible  tra^ormations  such  that  the  problem  has 
a  meanbgful  answer.  There  are  many  possibilities.  We  shall 
consider  the  problem  of  minimizing  A/’  over  the  set  of  general¬ 
ized  state-space  transformations  havbg  the  same  fixed  E  matrix. 
For  example,  b  studybg  the  solvability  of  sbgular  systems  or 
m  realizations  methods  researchers  often  use  the  so-called  «emi- 
aplieit  form  of  (l)-(2),  where  E  is  fixed  to  be 


E  = 


I.  O' 

0  0  ’ 


(^) 


and  0  <  r  <  n  (see  (5,19]). 

Define  for  arbitrary  ^ed  £,  the  set  of  transformations  leav- 
bg  E  bvahant: 


Te  =  {(£,  V)  €  GL:  (S)  I  rv  =  UE}.  (55) 

When  E  =  I  then  clearly  ?£  =  5.  To  perform  the  minimization 
of  A/*  =  A/i  over  Te,  we  employ  a  generalization  of  the  Lagrange 
multiplier  technique  given  b  |8,18j.  The  method  allows  one  to 
adjob  a  matrix  constraint  to  a  scalar  valued  performance  index 
by  nsbg  the  foUowbg  lenuna. 


Lemma  3.1  For  arbitrary  matneu  X  and  Y, 

Tr  A{X  -  K)  =  0  (56) 

for  all  orthogonal  matneu  A,  if  and  only  if  X  =  Y . 


Theorem  3.1  Afj  is  minimized  over  Tc  for  arbitrary  fixed  E 
only  if 

EP  =  QE.  (57) 


Proof.  First  consider  minimizing  the  product  Tr  P  TrQ  over 
Tf.  Define  a  Hamiltonian 


H  =  rr(r-’ £!'-’■)  TriLTQU)  ATr  A(£f  '  -  UE],  (58) 

where  A  is  the  Lagrange  multiplier  and  A  is  any  orthogonal  ma¬ 
trix.  A  necessary  condition  for  an  extremal  is 

^  =  Tr{V-'PV-'^)2lFQ-XEA  =  0  (59) 

^  =  (-2V'-'/>V'-n'-‘)  Tr(Cr‘Qf/)  +  AA£  =  0.  (60) 

Eliminatbg  A  and  A  by  combining  (59)  and  (60)  gives 

Tr  [l^QU]  ■  £V'-'  =  Tr  (V  PV''^]  •  U^QUE.  (61) 

Thus,  the  optimal  transformations  {U,  V)  will  generate  a  matrix 
pair  (/’,Q)  such  that 

rr(Q)  •  EP  =  Tr(P)  •  QE.  (62) 

Now  observe  that  for  non-negative  definite  P  and  Q 

TrP  +  TrQ>2{TrP-TrQY,  (63) 

with  equality  if  and  only  UTrP  =  TrQ.  Hence,  minimizbg  the 
full  performance  bdex 


Mi:=TrPTrQ  +  TrP^TrQ  (64) 


requires  Tr{Q)  ■  EP  =  rr(P)  •  QE  and  TrP  =  TrQ., 

b  the  event  that  £  s  /,  it  follows  that  those  realizations 
satisfybg  (57)  are  the  essentiaily  balanced  realizations  of  |8,18j 
or  those  specified  b  |9,10,15,16.17].  If  £  has  the  form  given 
b  (54),  then  it  follows  easily  that  any  bound-optimal  realiza¬ 
tion  (£,  A,  b,  c)  has  a  correspondbg  matrix  pair  (£,  Q)  with  the 
structure 


(65) 


where  L  €  S'*'  and  Pd,  Qd  G  S'*“'>*<*"'>.  Choosbg  (U,  V)  € 
Ts  such  that 


T  0  1. 
0  cl.  I  • 


V’  = 


T  0  ] 
0 


gives  for  small  values  of  f  >  0  the  approximation 


(66) 


Mit<iTrT-'ET-^ ■TrT'ET+TrT-^IT-'^  +  TrrU.  (67) 


Lettbg  A<  denote  the  f"*  eigenvalue  of  £  and  using  the  methods 
b  |14,15,16,17|,  it  follows  directly  that 

Af;>  ^£A.j  +2f;A..  (68) 

Furthermore,  there  always  exist  T  €  GL,(Si  such  that  this  lower 
bound  is  achieved  arbitrarily  closely. 
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IV-  LXAMPLE 


Cuiusider  the  problem  of  determiouiR  the  minimum  sensitivity 
generalized  state-space  realization  of  the  right-shift  operator  in 
semi-explicit  form.  It  is  easily  verified,  for  example,  that  h(:j  = 
z  can  be  minimally  realized  by 


The  corresponding  fundamental  matrix  is  given  by 


The  problem  of  determining  nunimum  sensitivity  generalized 
state-space  realizations  was  considered  by  firsi  denning  a  sensi¬ 
tivity  measure  analogoos  to  thai  for  ihe  usual  linear  system 
case.  The  minimization  of  the  measure  was  then  shown  to  be 
a  meaningful  problem  only  if  the  set  of  admissible  realizations 
was  significantly  restricted.  An  interesting  example  of  such  a 
restriction  was  the  subset  of  transformations  which  keep  the  E 
matrix  invariant.  In  this  case,  the  optimality  criterion  devel¬ 
oped  was  an  extension  of  the  known  result  for  the  nonsingnlar 
case  and  led  to  the  notion  of  an  essentially  balanced  generalized 
state-space  realization.  The  theory  was  applied  to  the  minimnm 
sensitivity  synthesis  problem  for  the  right-shift  operator. 


Thus, 
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Abstract 

The  system  Hankel  matrix  plays  a  central  role  in  reaiizar 
tion  theory  for  linear  time-invariant  systems.  For  example,  each 
full  rank  factorization  of  the  Hankel  matrix  into  a  reachability- 
observability  matrix  pair  is  uniquely  related  to  a  minimal  state 
space  realization.  In  addition,  factorizations  derived  from  the 
singular  value  decomposition  of  the  Hankel  matrix  are  associ¬ 
ated  with  balanced  realizations,  which  are  known  to  have  optimal 
parametric  sensitivity  properties.  In  this  paper,  a  system  Hankel 
matrix  for  singular  linear  systems  is  defined  and,  in  an  analogous 
fashion,  used  to  develop  a  realization  synthesis  method,  define 
a  balanced  generalized  state  space  realization,  and  characterize 
those  realizations  which  have  minimum  sensitivity  properties. 

1.  Introduction 

Developing  a  complete  realization  theory  for  singular  linear 
systems  has  provided  some  challenging  problems  for  researchers. 
There  has  been  significant  success  in  the  development  of  algo¬ 
rithms  for  minimal  realization  synthesis  from  given  input-output 
behavior  [3,4,6,7,18|.  Theory  also  exists  for  characterizing  mini¬ 
mality  [8,17|,  reachability  and  observability  [1,12,13, 144fi|i  uni 
determining  canonical  forms  [S|.  But  a  strong  unifying  realiza¬ 
tion  theory  for  singular  systems  comparable  to  that  for  the  non- 
singular  case  has  yet  to  be  presented.  A  solid  realization  theory 
is  certainly  a  prerequisite  for  deriving  physical  interpretations  of 
realizations,  as  well  as  understanding  computational  structures. 

In  this  paper,  a  Hankel  matrix  approach  to  singular  linear 
system  realization  theory  is  presented,  which  is  analogous  to  the 
methods  of  Kung  for  the  nonsingular  case  [llj.  The  focus  is 
exclusively  on  discrete-time  or  descriptor  systems.  Motivated 
by  definitions  of  reachability  and  observability,  the  notion  of  a 
system  Hankel  matrix  is  first  defined.  The  system  Hankel  ma¬ 
trix  is  then  shown  to  have  reachability-observability  matrix  fac¬ 
torizations  which  can  be  used  to  solve  the  realization  synthesis 
problem.  Next,  the  notion  of  a  balanced  realization  for  singu¬ 
lar  systems  is  derived  using  the  singular  value  decomposition  of 
the  system  Hankel  matrix.  The  final  section  gives  an  applica¬ 
tion  of  this  theory  for  synthesizing  realizations  which  have  min¬ 
imum  parametric  sensitivity  properties.  It  is  demonstrated  that 
such  realizations  are  related  to  the  notion  of  balancing.  Analo¬ 
gous  connections  exist  for  the  nonsingular  case  [9,19|,  and  such 
a  cormection  has  been  demonstrated  in  the  singular  case  by  a 
completely  independent  method  il0|. 


n.  The  Singular  System  Hankel  Matrix 

A  singulsir  linear  system  of  difference  equations 

Exk+i  =  +  Buk  (1) 

ilk  —  Cxh,  (2) 

where  £  is  a  singular  n  x  n  matrix,  is  said  to  realize  a  given  p 
by  m  rational  transfer  function  matrix  H{z]  when 

3{z)  =  C{2E-A)-'B.  (3) 

In  the  usual  linear  system  case,  where  .E  =  /,  it  is  well  known 
that  state  variable  models  are  not  unique.  Any  two  tninimaJ  state 
apace  realizations  of  a  given  transfer  function  matrix  are  related 
by  a  similarity  transformation,  he.,  by  a  change  in  basis  for  the 
state  space.  In  the  singular  case,  however,  minimal  realizations 
are  not  necessarily  related  by  similarity.  The  set  of  admissible 
transformations  is  considerably  larger.  Any  matrix  pzur  (U,  V)  € 
GL1{9)  =  GL„(S)  X  GL„{lk)  will  provide  another  i»**  order 
realization  of  the  transfer  matrix  by  applying  the  action 

4  :  Gii:(3i)  X  E,(S)  « 

;  {U,V)  X  (E,A,B,C)  >-*  (l/-'EV,tr'AV,U-'B,CV), 

where  Zn(Sf)  is  defined  as  the  set  of  all  order  realizations. 
In  general  ^  is  not  a  similarity  action  since  U  and  V  may  be 
distinct. 

For  a  given  realization  {E,A,B,C),  the  corresponding  fun¬ 
damental  matrix  is  defined  in  terms  of  the  unique  Lau¬ 

rent  expansion  about  the  point  at  infinity  of  the  resolvent  matrix 
(zE-A)-‘ 

(sE-A)-*=s-‘  f;  (4) 

where  p  >  1  is  the  index  of  nilpotency  of  the  matrix  pencil 

(zE  —  A)  [l2,13j.  The  fundamental  matrix  is  known  to  satisfy 
the  recursion  relations 

E^t  —  Adk-i  =  SokI  (5) 

<^kE  —  ^k-iA  =  Sok^-  (6) 

ia  the  Kronecker  delta.)  Furthermore,  equation  (4)  implies 

that  the  corresponding  transfer  matrix,  H{z),  has  a  series  rep¬ 
resentation 

H{z)=  f:  H,z-.  (7) 

where 

H.  =  Cdi.iB.  (8) 
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Hence,  we  shall  refer  to  the  sequence  defined  by  (8)  as 

the  generalized  Markov  parameters  for  the  system  (l)-(2)  [IS]. 

The  notions  of  reachability  and  observability  for  singular  sys¬ 
tems  is  an  active  area  of  research  [1,8,12,13,14,16].  The  possible 
noncausality  of  (l)-(2)  makes  the  extension  of  these  concepts 
nontrivial.  For  the  purpose  of  factoring  the  system  Hankel  mar 
trix,  it  useful  to  define  the  notions  of  forward  reachability  and 
observability  as  given  below. 

Definition  2.1  [14]  A  regular  system  (l)-(S)  i*  »aid  to  be  for¬ 
ward  reachable  if,  for  every  z  €  91",  there  exists  an  integer 
j  >0  and  an  input  {uk}tao  ^  when  Zo  =  0. 

Definition  2.2  [14]  A  regular  system  (l)-(S)  is  said  to  be  for¬ 
ward  observable  if  there  exists  an  integer  i  >  0  such  that  if 
the  zero  input  response  {yk}to-k  **  precisely  zero  then  Exo  =  0. 

The  following  tests  can  be  used  for  determining  forward  reach¬ 
ability  and  observability. 


Proof.  Viewing  l/[>,j]  as  a  2  x  2  block  partitioned  matrix,  it 
is  clear  that  the  block  matrix  in  the  lower  right  position  is  the 
Hankel  matrix  for  the  strictly  proper  portion  of  H(s),  say  H„(z). 
It  is  well  known  that  ^,,(2)  has  a  finite  dimensional  state-space 
realization  if  and  only  if  its  corresponding  Hankel  matrix  )l,,\i,j\ 
has  the  property  that  there  exists  positive  integer  r  such  that 
p{H„\r  +  »,r  -I-  j])  =  r  for  all  i,j  =  0,1,2,....  In  view  of  the 
Weierstrass  form  [13],  if  yt  is  finite  then  the  necessity  condition 
follows  directly.  The  sufficiency  condition  follows  from  the  fact 
that  when  both  nontrivial  submatrices  of  V  [-f-oo,  +00]  have  finite 
rank  then  a  finite  dimensional  generalized  state  space  realization 
can  be  constructed  by  known  algorithms  (see  for  example  [6]).  ■ 


Lemma  2.3  Every  realization  (E,A,B,C)  of  a  given  trarufer 
matrix  is  related  to  the  system  Hankel  matrix  via  the  equality 

J/[.-,j]  =  0,EZ,.  (9) 


Lemma  2.1  [14]  A  regular  system  (l)-(B)  is  forward  reachable 
if  and  only  if  the  forward  reachability  matrix 

zae,a,b)=  -  •  ;  ^ 

[  ...  ^oB  ^iB  ...  ^,B  J 

has  rank  n  for  j  =  deg{\  zE  —  A  ]). 


Lemma  2.2  [14]  A  regular  system  (!)-(£)  is  forward  observable 
if  and  only  if  the  forward  observability  matrix 


C<^., 


0,(B,yl,C)  = 


C4>.x 

C<l>o 

C<bx 


L  \ 

has  at  least  the  rank  of  E  when  i  =  deg{\  zE  —  A  ]). 


Proof.  The  result  follows  directly  from  the  definitions  using  the 
property 

i  :  t  <  0,  j  <  0 


•bxEtbi  = 


4>i*i  :  »■  >  0,  y  >  0  , 


0  :  otherwise 


which  is  proven  in  [14].  a 


This  factorization  is  the  natural  extension  of  the  result  known 
for  the  usual  linear  system  case  (B  =  J).  A  surprising  property, 
however,  of  the  generalised  Hankel  matrix  is  that  it  only  specifies 
the  system  uniquely  modulo  a  feedforward  component.  That  is, 
the  parameter  Bo  does  not  appear  in  l/[*,y].  The  consequences 
of  this  fact  will  be  discussed  in  the  next  section.  Also,  note  in 
reference  to  Theorem  2.1  that 


p(E)  >  p(lf  [-1-00,  -1-00])  =  u  (10) 


for  any  realization  (B,A,B,C)  of  B(z).  Furthermore,  it  follows 
directly  that 


Consider  the  following  definition. 


Definition  2.3  The  system  (block)  Hankel  matrix  for  a  given 
ratiofud  transfer  matrix  H{x)  is  defined  as 

■  0 

0 

0  ...  0 

0  -B-^. 

0 

^/[«-.yi  = 

0  - 

B.,^,  ...  -B., 

Hx 

0  Bj 

Bj  ...  By+I 

Bj 

Bn-x 

B,+3  . . .  . 

[p  -  l)p(B_„+,)  +  r  <  J/  <  (/i  -  1)  mm(p,m)  +  r.  (11) 

r 

When  (B,  A,  B,C)  is  a  minimal  realization  with  n  =  n,M>  then 

r  =  deg{\  zE  -  A  ]).  (12) 

From  the  characterization  of  n,,,,  in  [6]  a  simple  calculation  gives 

pp[H.„^.i)  +  r  <  <  p  mm(p, m)  +  r.  (13) 

By  assumption  is  not  identically  zero.  So  in  the  case 

where  min(p,m)  =  1,  the  inequalities  (11)  and  (13)  combine  to 
give 

f  +  1.  (14) 


Theorem  2.1  The  system  Hankel  matrix  for  a  given  rational 

transfer  matrix  H(z)  has  a  finite  dimensional  generalized  state-  HI.  Hankel  \latrix  Realization  Theory 

space  realization  if  and  only  if  there  exists  non-negative  integers  r 
and  V  such  that  the  rank p{)l\r+i,r+j])  =  1/  for  all i  =  0,1,2,.. . 
and  j  =  0,1,2,.... 
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In  this  section,  a  realization  theory  and  algorithm  is  pre¬ 
sented  which  is  analogous  to  that  given  by  Kung  in  [ll]  for  clas¬ 
sical  linear  systems.  That  is,  we  wish  to  consider  the  problem 


of  extracting  generalized  state  space  realizations  from  appropri¬ 
ate  factorizations  of  a  given  system  Hankel  matrix  as  suggested 
by  Lemma  2.3.  The  general  algorithm  presented  herein  is  not 
viewed  as  being  particularly  efficient  or  stable,  but  rather  as  a 
theoretical  tool  to  exhibit  some  of  the  structure  of  the  realizar 
tion  theory.  The  extended  theory  is  considerably  more  complex 
due  to  the  singular  nature  of  E.  For  example,  unlike  the  non¬ 
singular  case,  not  every  factorization  maps  to  a  corresponding 
realization.  Consider  the  following  definition. 

Definition  3.1  A  factorization  l/[t,j]  =  QiEZ^  of  a  given  rank 
V  syatem  Hankel  matrix,  where  E  €  Oi”’*",  u  taid  to  be  consis¬ 
tent  if 

I*  n  ^ 

iV.  ronk(fi'(»,j])  =  Tank{E), 

Hi.  0,EZ-  -  r„  =  0,^EZ^  - 

where  Fiy  ts  defined  a» 


with  the  block  column  being  nonzero,  and  Lij  similarly  defined 
with  the  block  row  being  noruero.  The  notations  [•|~  and  [-P 
represent  the  block  column  left  shift  and  the  block  row  up  shift 
operators,  respectively. 

A  consistent  factorization  {Oi,E,  Z^)  has  the  property  that  at 
least  one  realization  can  be  synthesized  from  it.  Observe,  from 


equations  (S]-(6)  it  follows  directly  that 

EZ~  =  Aje, +  10...0B0...0|  (16) 

0/E  =  aA  +  [0... 0(7^0. .’.O]*-.  (17) 

Premultiplying  equation  (16)  by  and  postmultiplying  (17)  by 
Zj  gives 

CiEz;-  =  OiAZ,  +  r„  (is) 

OJEZ,  =  aAZ,-l-L„:  (19) 

Thus,  the  remaining  realization  matrices  are  given  by 

C  =  the  block  row  of  Li^Z]  (20) 

B  =  the  Uoek  column  of  O/F,,  (21) 

A  =  0/(0/£-[O...OC'’0...0n  (22) 

=  (rje- -[o...oflo...o))je;,  (23) 


for'  sufficiently  large  i  and  j  and  where  [•]'  denotes  a  pauedo- 
inverse.  These  realization  matrices  are  uniquely  specified  if  p(0i) 
=  p(Zy)  =  n.  That  is,  when  (E,  A,  B,  C)  is  both  forward  reach¬ 
able  and  forward  observable. 

It  is  of  interest  to  note  that  since  the  system  Hankel  matrix 
is  not  a  function  of  Ha,  the  direct  feed  term,  one  has  the  option 
of  setting  Hig  =  0  in  the  assignment  of  Fi,-  and  above  and 
then  compensating  by  adding  the  known  direct  feed  term  to  the 
output  equation,  i.e., 

y»  =  Cx»  +  HaUa.  (24) 


In  fact,  this  interplay  between  the  direct  feed  term  in  the  semi¬ 
state  equation  and  that  in  the  output  equation  is  arbitrary.  If 
one  sets  the  direct  feed  parameter  in  F^y  and  to  any  compat¬ 
ible  matrix  A  and  then  compensates  by  adding  the  direct  feed 
term  {Ha  —  A)  Ui,  to  the  output  equation,  the  transfer  matrix  is 
invariant. 

IV.  Balanced  Generalized  Realizations 

It  is  well  known  in  linear  system  theory  that  a  balamced  re¬ 
alization  of  a  given  transfer  matrix  can  be  extracted  fiom  the 
singular  value  decomposition  (SVD)  of  its  system  Hankel  matrix 
[llj.  More  specifically,  the  SVD  can  be  used  as  a  tool  for  com¬ 
puting  a  special  set  of  factorizations  of  the  Hankel  matrix  which 
has  the  property  that  all  corresponding  realizations  yield  equal 
and  diagonal  reachability  and  observability  grammians.  In  this 
section,  the  extension  of  this  idea  is  considered  for  the  singular 
system  case.  It  should  be  mentioned  that  the  notion  of  balanc¬ 
ing  for  singular  systems  has  been  defined  in  [lOj,  but  in  a  quite 
different  context.  The  following  approach  is  consistent  with  this 
earlier  definition,  but  is  considerably  more  direct. 

In  terms  of  the  forward  reachability  and  observability  matri¬ 
ces,  the  natural  definitions  of  the  reachability  wd  observability 
grammians  are  given  below. 

Definition  4.1  For  a  regular  system  (l)-(S),  define  the  for¬ 
ward  reachability  and  observability  grammians  as 


Pi  =  ZiZf 

(25) 

and 

’  . 

Qi  =  OTOi, 

(26) 

respectively. 

It  is  easily  verified  that  the  steady-state  forward  grammians 
(when  the  limits  exist) 


P  ^  limP,=  £  (27) 

•  ' O*  I 

Q  =  llimgi=  £  (28) 

satisfy  the  equations 

EPE^  =  APA^ +  {E<baBB'(i>lE^ -A(b.yBB'<bliA^) 
H'QE  =  A^QA  +  {£f*'4'‘SC'C<^-A^4'ZiC^C<b.iA). 

For  the  usual  linear  system  case,  these  equations  reduce  to  the 
familiar  Lyapunov  equations. 

Theorem  4.1  The  infinite  system  Hankel  matrix  if [-l-oo,  +oo| 
with  rank  u  corresponding  to  a  given  rational  transfer  motnz 
H(s)  has  a  consistent  factorization  of  the  form 


0  =  UE'J\  Z  = 

(29) 

^  [  0  0  J  ’ 

(30) 

where  and  Ej/^  are  full  rank  diagonal  matrices  and  U^U  = 
K’-V  = 
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Proof.  The  aingular  v&lue  decomposition  of  ^|+ao,+oo]  is 
;/[+oo,+oo]  =  tfEV’’ =  l/ny’EEi'V’’  (31) 


=  U 


t'r 

0 


[t  SjlEr  oiv',(«) 


where  £o’  =  •  •  • ,  V^>  ■  •  ■  •  = 

d\ag(y/al, ...,  y/aZ,  ...,  >//?»-..)  with  the  a<’8  and  0j’s  as 

nonzero  free  parameters.  (0  denotes  a  compatibly  sized  zero 
matrix.)  This  decomposition  leads  directly  to  the  factorization 
in  equations  (29)- (30).  This  factorization  is  consistent  since  con¬ 
ditions  (i)  and  (ii)  are  satisfied  by  design,  and  condition  (iii)  can 
be  shown  to  be  satisfied  in  the  limit  by  direct  substitution  of 
equations  (29)-(30)  ■ 


CoroUary  4.1  Any  factorization  of  the  form  given  in  Theorem 
4-1  has  the  property  that  all  corresponding  realizations  have  for~ 
ward  reachability  and  observability  grammians  such  that 

EP  =  QE  =  ETz  =  T.oE.  (33) 

Definition  4.2  Any  realization  (E,A,B,C)  satisfying  equation 
(SS)  is  said  to  be  a  balanced  realization. 


V.  Mmimiun  Sensitivity  Realizations 

In  this  section,  we  consider  the  problem  of  finding  minimum 
sensitivity  generalized  state  space  realizations  of  a  given  ratio¬ 
nal  (possibly  noncausal)  transfer  matrix  H(z).  This  problem  has 
been  studied  in  a  purely  algebraic  context  [10],  but  the  following 
approach  is  geometric  in  nature  and  provides  a  natural  applicai- 
tion  of  the  realization  theory  described  above.  First,  an  abstract 
geometric  approach  is  briefly  described  for  solving  generic  min¬ 
imum  sensitivity  synthesis  problems  [9,19].  Then  the  approach 
is  applied  for  the  singular  linear  system  case. 

A  generic  realization  space,  say  6,  is  defined  tr  oe  an  affine 
space  of  admissible  realizations  (usually  some  subset  of  K')  with 
the  structure  of  a  smooth  fUemannian  manifold.  Each  point  9  € 
0  corresponds  to  an  admissible  realization.  In  every  modelling 
problem,  there  are  invariants  which  are  related  to  the  observed 
behavior  one  is  trying  to  model,  e.g.  Markov  parameters  in 
the  linear  systems  case  or  the  Volterra  kernels  for  more  general 
Volterra  type  systems.  Using  these  invariants,  called  observables, 
it  is  possible  to  partition  a  realization  space  into  equivalence 
classes.  Two  realizations  are  in  the  same  equivalence  class  if 
their  observables  assume  the  same  values. 

Assume  that  /  ;  0  •-»  S  is  a  smooth  function  that  maps  each 
realization  to  a  corresponding  scalar  observable.  Furthermom, 
assume  that  /  has  no  critical  points  in  0.  Then  if  follows  that 
each  equivalence  class  Mk{f)  =  /”‘(fc)i  k  €  /(0),  is  a  subman¬ 
ifold  of  0  with  dimension  s  —  1.  The  observable  thus  induces 
a  decomposition  of  the  realization  space  into  a  set  of  connected 
submanifolds  each  with  dimension  s  —  1.  Such  a  realization  space 
is  said  to  be  foliated,  and  the  points  in  each  equivalence  class 
make  up  a  leaf  of  the  foliation. 

The  parametric  sensitivity  problem  is  posed  as  follows:  at 
which  points  (realizations)  on  a  given  leaf  is  the  effect  on  the 
observables  of  a  parametric  perturbation  extremal  ?  On  each 
leaf  the  gradient  of  /  defines  a  smooth  normal  vector  field.  At 
each  point  on  a  given  leaf,  the  metric  tensor  g  on  &  induces 
the  norm  1|  ■  ||:  r,0  R  :  v  i-*  yjg(0)(v,v)  on  the  tangent  space 

r,0. 


Definition  S.l  A  realization  9"  6  Mk{f)  i*  on  extremal  sen¬ 
sitivity  point  of  if  and  only  if  9“  is  an  extremal  of  the 

performance  index  L{9)  =  A  ||  V/  |P  over  the  manifold  Mk[f). 

The  following  Theorem  (see  |9,19|)  gives  a  necessary  condition 
satisfied  by  all  extremal  points  in  the  event  that  the  metric  ten¬ 
sor  g  on  0  is  taken  to  be  uniform  (a  typical  assumption  in  the 
analysis  of  fixed  point  arithmetic). 

Theorem  5.1  If  a  realization  9"  &  Q  is  an  extremal  seruitivity 
point  then 

(VV(r)-A/)V/(fl-)=0,  (34) 

where  V f(9')  and  V®/(tf*)  are  the  gradient  vector  and  the  Bes~ 
Stan  operator,  respectively,  at  9",  I  is  the  identity  operator  on 
the  tangent  space  Tf&  and  A  €  S. 

In  other  words,  the  gradient  vector  of  /  at  ^  is  an  eigenvec¬ 
tor  of  the  Hessian  matrix  at  9‘.  The  stated  condition  is  the 
Euier-Lagrange  equation  for  the  constrained  optimization  prob¬ 
lem.  The  type  of  extremum  u  easily  determined  by  the  definite¬ 
ness  of  the  second  variation. 

To  cast  the  mininnim  sensitivity  synthesis  problem  for  singu¬ 
lar  systems  in  the  geometric  context  described  above,  we  must 
first  identify  the  relevant  realization  spaces.  There  are  in  fact  two 
genera]  realization  spaces  we  shall  consider:  the  space  of  all 
order  generalized  state  space  realizations  [E,A,B,C),  C»(S), 
and  a  space  related  to  all  consistent  factorizations  of  all  possible 
system  Hankel  matrices  with  rank  <  v,  n„.  £.  is  clearly  iso¬ 
morphic  to  a  closed  subset  of  »»•’+»«+"*,  Xo  define  the  second 
realization  space,  consider  the  mapping 

w  :  £a(R)  >-»  X  Si"*'*^'''**"*"*’'** 

:  lE,A,B,C)>-r{OdE,A,C)E,EJiAE,A,B)),  (35) 

where  i,j  >  r~l  are  assumed  to  be  fixed  a  priori.  For  fixed  E, 
the  marginal  map  Us  defines  the  following  subsets  of 

n.(»)  =  ws  (£,(»)) 
n:'(»)  =  w.(Ei'(»)) 

ni'(»)  = 

ni'-^*(»)  =  w,(j:i'J*(»)). 

where 

Si'lR)  =  {s  6  S.(5l)  j  \zE  -  A|  /  0,  p(je,.,(£,A,B))  =  n} 
£;;'(»)  =  {s  €  E.(»)  1  \zE  -  A|  ^  0,  P(0.-.(E,  A,C))  =  n} 
£/,./.(jj)  ^  j,;;'(Ji)  n  £'•’(»). 

(/r  and  fo  refer  to  forward  reachable  and  forward  observable, 
respectively.  Note  that  Ei*(a)  is  only  a  subset  of  all  possible 
forward  observable  realizations.) 

In  general,  a  group  action  on  a  manifold  is  said  to  be  a  foliated 
action  if  the  orbits  form  leaves  of  a  foliation.  A  foliated  action  is 
characterized  by  the  property  that  the  dimension  of  the  isotropy 
subgroup  at  any  point  on  the  manifold  is  fixed.  The  actions 

^  :  C7Z;»(R)  X  £.(») 

:  (U,V)  X  {E,A,b,c)  ^  [V-^EV,U-^AV,V-^B,CV) 

and 

i>  :  CL»(»)  X  n„{R) -- a(») 

:  (V,V)x{0E,EIi)>-*{0EV,V-'E]l.) 
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on  E,(S)  and  n„(!R),  respectively,  are  not  foliated  actions 
unless  restricted  to  forward  reachable  and  observable  subsets 
respectively  [9). 

An  orbit  in  either  realisation  space  £''*(91)  or  n''*(9i)  under 
its  group  action  is  characterized  not  by  one  observable  but  by 
several,  naunely  the  entries  from  the  corresponding  Hankel  msr 
trix.  Hence,  we  must  slightly  generalize  the  geometric  method 
described  above.  Define  the  following  famUita  of  scalar>valued 
observable  functions 

h{E,A,B,C)  =  TtA{)(-0{E,A,C)EZI,E,A,B)){Zfi) 
h[OE,EJL)  =  TtlL()l-OEZ),  (37) 

where  A  is  any  compatible  matrix  which  has  all  of  its  singular 
values  precisely  equal  to  unity.  Note  that  if  =  0  for  all  such 
i  then  V  =  OEZ  (proof  in  [9,19]).  Thus,  it  follows  that  for 
a  v'^'en  singular  linear  system  characterized  by  a  rank  v  Hankel 
matrix  1/,  the  corresponding  orbit  in  either  E'''^*(!R)  or  ni'-^*(!ll) 
is  uniquely  characterized  by  /*  =  0  or  /*  =  0,  respectively,  for 
all  admissible  A.  If  we  express  in  component  form 

A  =  L  -  0ER),„  (38) 

it  is  apparent  that  this  family  of  observables  is  defined  by  the 
set  of  constraints  on  the  components  of  the  OE  and  EJL  msr 
trices  with  each  component  of  A  playing  the  role  of  a  Lagrange 
nniltiplier. 

To  characterize  extremal  sensitivity  realizations  of  singular 
linear  systems,  we  should  apply  Theorem  5.1  to  the  observable 
Jt,.  This  turns  out  to  be  a  formidable  problem.  So  instead  we 
shall  work  in  the  realization  (factorization)  space  n^''^*(Sl)  with 
the  goal  of  relating  the  solutions  of  the  two  problems  by  other 
means.  Applying  Theorem  5.1  to  the  observable  A  is  a  relatively 
simple  problem  because  it  is  a  linear  function  of  the  components 
of  [0E,ER).  Consider  the  following  Theorem. 


It  is  interesting  to  note  that  this  computation  is  possible  even 
though  0  and  Z  can  not  be  determined  unifue/y  from  OE  and 
EZ  due  to  the  singularity  of  E. 

The  computation  of  the  Hessian  matrix  is  not  so  obvious  since 
the  gradient  vector  is  not  an  explicit  function  of  $.  Consider, 
however,  that  by  the  product  rule  (see  |2j,  T4.3) 


3vec(0<£) 

dvec(Di)r’ 

avec((£Je,)n 

dvoc(;Cf)’’ 


'  -S'*'. 

*  ^ec(j?^)^  ®  =  E  9  li- 


Thus,  it  follows  that 


®  ^•^dvec(  (?,£)’• 


(E9li) 


dyoc(Zf) 

dvec([EZ,Yy 


For  brevity,  let 


/« 


(45) 

(46) 


avec(a)  avec(^f) 

°  dyec(0,E)r'  ~  dyocaEZ,)^y 

Then,  by  the  chain  rule  (see  {2},  T4.6),  it  follows  that  the  Hessian 
matrix  is 


^Va(tf)  = 


0  -(/<  9  A)Do 

-(L  ®  AfDj,  0 


The  optimality  condition,  then,  is  as  follows; 


(48) 


(^A(tf)-A/^)VA(tf)=0,  (49) 

r  -AAi  -(/<®A)Dol  [  -(4®  A)vec(0<)  1 
[  -(/*  ®  A)^Dx  i  ■  t  -(L  ®  Afvee(ZT)  J  “ 

(50) 

Equation  (50)  gives  directly  that 


Theorem  5.2  Given  a  tingvlar  linear  system  characterised  by  a 
syuare  Hankel  matrix  if[t,t],  then  extremal  sensitivity  points  on 
the  eorresportding  lea/  of  the  foliation  induced  by  the  observable 
family 

A(a£,  EZ,)  =  Tr  A()i[i, i]  -  0,EZ,)  (39) 

have  the  property  that 


Avec(Oi) +  Do(/<  ®  A)’‘vec(jef)  =  0  (51) 

Z7a(/<  ®  A)vec((?i)  +  Avec(X,*’)  =  0.  (52) 

Premnltiplication  of  equations  (51)  and  (52)  by  {E’’  9  4)  and 
(E  9  li),  respectively,  and  appiic.'.tion  of  the  properties  given  in 
(45)  and  (46),  gives 


EZ,Zf  =  0^0, E. 

Proof.  Use  the  optimality  equation  (34).  In  this  case 


.\yoc[ZrE^) 

[  yec(OiE) 


(40) 


(41) 


The  observable  function  can  be  expressed  in  terms  of  a  quadratic 
form  in  9  via  Kronecker  product  algebra  [2|  as  either 

A(8)  =  TrA{)l{i,i]  -  (vec(12f £’•))’•  •  (4  ®  A)  •vec(a)  (42) 


or  as 


/*(«)  ='n-A(;/|i,il-(vec(jen)’’{A®A)-vec(aE).  (43) 

With  these  representations,  the  gradient  vector  is  computed  as 


-(A  ®  A)vec(0,) 

-  -(4  ®  A)’-vec(jen 


(44) 


A(E*’®4)voc(ai)  +  (4®  A)’‘vec(jen  =  0  (53) 

(4  ®  A)vec(C(,)  +  A(E  ®  4)vec(jef)  *  0.  (54) 

It  follows  then  that 

XOiE  +  A^Zf  =  0  (55) 

AOi+AierE*’  =  0.  (56) 

Hence,  the  conclusion  follows  immediately  using  the  fact  that  A 
is  an  orthogonal  matrix  and  A  /  0.  ■ 

Thus,  we  concluded  that  minimum  sensitivity  factorisations 
with  E  fixed  in  the  form  given  in  (30)  have  corresponding  realizar 
tions  that  are  neariy  balanced  (letting  t  -»  oo)  in  the  sense  that 
they  are  only  an  orthogonal  transformation  (roUtion)  awqr  from 
being  balanced.  This  conclusion  is  analogous  to  that  reached  via 
the  earlier  algebraic  approach  in  [lOj  which  worked  directly  in 
the  first  realization  space,  rather  than  in  this  intermediate  fac¬ 
torisation  space.  When  E  =  I,  the  optimality  condition  reduces 
to  the  usual  result  for  linear  systems. 
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VI.  Conclusions 

In  this  paper,  a  definition  of  a  system  Hankel  matrix  is  given 
for  singular  linear  systems  and  used  to  develop  a  realization  the¬ 
ory  based  on  a  factorization  approach.  The  singular  value  de¬ 
composition  was  then  applied  to  the  system  Hankel  matrix  to 
define  a  balanced  generalized  state-space  realization.  Such  real¬ 
izations  were  then  used  to  characterize  those  realizations  which 
have  minimum  parametric  sensitivity  properties. 
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Abstract 


The  notion  of  system  description  or  representation 
Is  reconciled  vlth  system  realization  or 
Implementation.  By  the  latter  a  causal  construct  for 
obtaining  the  solution  will  be  understood.  The 
realization  starts  from  a  particular  description,  where 
dynamical  and  algebraic  equations  are  separated,  and 
the  dynamical  states  correspond  to  Integrator  outputs. 
Clearly  such  a  description  la  nonunlque.  The  optimal 
realization  problem  Is  then  to  find  the  particular 
realization  In  the  orbit  of  a  particular  group  H,  which 
minimizes  the  sensitivity  measure  In  [7].  This  group  H 
Is  a  subgroup,  leaving  the  (separation)  atructxtre 
Invariant,  of  the  group  characterizing  the  orbits  under 
restricted  system  equivalence.  A  practical 
Implamentaclon  Is  one  for  which  asymptotically  (In  a 
well  defined  sense)  the  solution  Is  obtained  with  a 
causal  regular  realization.  The  asymptotics  are 
obtained  by  considering  sequences  of  systems,  or  as  we 
propose,  via  the  use  of  nonstandard  analysis  methods. 
The  behavior  of  the  regularizing  parameter  Is 
determined  by  the  structure  at  Infinity  of  the  original 
singular  system.  As  exsmples,  a  differentiator  and  a 
purely  algebraic  sat  of  equations  are  discussed. 


Introduction 


In  previous  work  on  the  sensitivity  minimization 
of  singular  systems  [7],  the  problem  of  deriving  the 
realization  of  the  singular  system  was  attacked.  This 
problem  la  wall  understood  for  regular  systems.  Its 
practical  significance  la  that  In  the  presence  of 
parematar  dlsturbaneaa  the  response  of  the  (perturbed) 
optimal  realization  Is  close  to  the  nominal  or  desired 
response.  Now  In  following  a  similar  prograMs  for 
singular  systems  one  la  faced  with  an  additional 
problem  of  Interpretation.  Uhat  Is  an  laplamentatlon 
of  a  singular  system?  Realization  In  the  pure  sense 
always  means  an  Implementation  with  Integrators.  As  Is 
wmll  known  singular  systems  may  ozhlblt  net 
differentiation.  How  la  one  to  Implement  this?  In 
discrete  time,  the  situation  seams  even  worse,  since 
noncausal  behaviour  (l.a.  advances)  nay  result.  On  the 
other  hand,  any  simulation  or  computation  Is  inherently 
causal . 

In  this  paper  we  shell  cry  Co  reconcile  Che  notion 
of  system  raprssenceclon  or  description,  and  Chat  of 
system  raellzatlon  or  Implsmencaclon.  The  latter  will 
always  mean  a  causal  construction,  for  Instance  using 
a  universal  Turing  machine.  Only  when  this  problem  la 
satisfactorily  answered.  and  practical  ways  of 
computing  solutions  to  so-called  singular  systems  have 
been  found.  will  ic  sMka  sense  to  optimize  the 
computation,  and  speak  of  minimum  sensitivity 
Implaamtaciona . 


The  purpose  of  this  presentation  is  to  provide 
some  ideas  cowards  the  solution  of  the  above  sketched 
problem,  with  practical  implementations  in  mind.  Some 
new  tools  will  be  given,  in  particular  the  nonstwdard 
analysis.  Uhila  still  a  young  (a  little  more  than 
twenty  years)  branch  of  mathematics.  Its  presence  has 
already  been  felt  in  the  theory  of  differential 
equations .  And  tihile  its  name  nay  insinuate 
abhorrltions,  ic  is  not  true  that  ic  lies  outside  the 
’classical*  doiaain  of  mathematics,  nor  Is  It  In 
conflict  with  It.  As  expressed  by  Diener  and  Reeb  [5] 
in  chair  introduction,  the  nonstandard  analysis  adds 
-.aw  possibilities  to  one's  toolbox  by  giving  the 
existing  cools  more  power.  This  paper  will  then  also 
only  be  a  rather  modest  exploration  of  a  potential  use 
of  nonstandard  analysis  in  a  branch  of  system  theory. 
Perhaps  more  inspired  researchers  will  smooch  out  the 
comers . 

In  this  spirit,  the  differentiator  la  explored 
first  in  the  next  two  sections,  the  last  of  which 
considers  the  Inf lute  frequency  behavior  more  closely. 
The  following  section  presents  some  general  ideas 
distilled  from  this  ease  study.  In  ttam  this  Is 
followed  by  a  system  consisting  of  puce  algebraic 
equations.  An  effort  Is  made  to  compute  (In  a  causal 
way),  solutions,  or  approximations  of  solutions  Co 
singular  systems.  Some  reflections  are  collected  In 
Che  conclusions ,  and  an  appendix  gives  a  short 
"tutorial*  on  nonstandard  analysis. 


Case  Study  of  a  Singular  System:  The  Differentiator 


The  differentiator  has  a  singular  representation 


r  0 

'  *1  ■ 

m 

■  1  0  ] 

r  *ii 

■  0  ■ 

*w^ 

o 

o 

.  ^ . 

.  0  1  J 

.  *2 

.  1  . 

y  -  [-1  01  txi.xjl' 


Symbolically,  we  shall  reprasanc  the  system  In  an 
’open*  form,  as  Indicated  In  figure  1.  Here  two  new 
symbols  are  Introduced,  a  destructor  or  black  hole 
(or  sink).  Indicated  by  the  big  dark  arrow,  and  a 
creator  or  white  hole  (a  source).  Indicated  by  Che  big 
white  arrow.  The  Idea  la  to  lac  these  symbols 
ropreaanc  the  algebraic  constraints  in  the  above 
'system' .  Thus  the  black  hole  MUST  have  a  zero  signal 
going  INTO  It,  whereas  at  the  ’white  hole*,  a  signal  Is 
CREATED,  hare  x^,  which  forces  the  black  hole  Input  to 
be  zero. 
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Figura  1.  Tha  *opan  fora*  rapraaantation. 


So  far  wa  hava  only  a  naw  plctura,  buc  nothing 
aaaanclaXly  naw  yac.  Soaa thing  naw  la  obtainad,  at 
laaat  haurlatlcally.  If  wa  think  of  tha  black  and  whlta 
holaa  raapaetlvaly  aa  Input  and  output  of  aona  faat 
ayataa,  aa  Indleatad  In  flgura  2,  alttlng  'bahlnd  tha 
aeana*,  and  than  ralax  tha  eonatraint  that  tha  black 
hola  Input  Buat  ba  zaro. 


Flgura  2.  Tha  *faat*  ayataa. 


Glwan  a  aaall  davlatloe  at  tha  Input  of  tha  *faat* 
ayataa.  It  chan  ^Ickly  changaa  tha  algnal  at  tha  whlta 
hola  pert,  lea  output.  Than  ena  can  hopa  chat  with 
Chla  faadl^k  atruetura,  tha  black  hola  Input  will  ba 
driven  CO  zaro.  Thla  aounda  plaualbla  la  worda,  buc 
will  It  work?  Tha  problaa  la  how  to  charactarlza 
*faaC*.  Moraovar  thlnga  wara  dlacuaaad  In  aone  kind  of 
a  daeeuplad  way.  What  will  happen  If  tha  two  ayacaaa 
ara  eonnaccad?  Will  Ita  aoluclon  converge  In  aoaa 
aenaa  to  tha  aolutlon  aaclafylng  tha  original 
rapraaantation?  Wa  anawar  aoaa  of  Chaaa  quaaclona 
below. 

Flrat  aaa«»a  that  tha  *faae*  ayataa  la  realized  by 


€Xj  -  1.(X2+U). 


Thla  la  an  obaarwar  for  a  ayataa  with  dynaalca 
dxi(t)/dt  -  0  .  Tha  factor  «  la  uaad  to  Indicate  that 
tha  obaarwer  gain  l/«  la  actually  vary  large.  We  will 
quantify  It  later.  If  one  puca  thla  faat  ayataa 
between  tha  black  and  tha  white  hola,  tha  overall 
cranafarfuneclon  of  tha  two  Integrator  ayataa  la 


*2(a)  -  Uu(a)/[<a^-l) 

A 

Clearly  thla  la  different  froa  tha  -au(a)  that  would 
be  required  to  hava  differentiation  froa  u(t)  to  y(t). 
However  one  can  aaka  thla  work.  If  one  can  argue  that  « 
la  aueh  aaallar  than  the  aodulua  of  \/*  for  the 
fraquanelaa  of  Intaraat.  Thla  talla  ua  at  once  that  to 
eonaldar  tha  behaviour  at  Infinity,  e  cannot  approach 
tare  Independently  froa  tha  way  that  a  approachaa 
infinity! 


Practically,  Chla  aaana  now  chat  If  u(t)  has 
highest  frequency  coaponenta  “  2»fQ,  ^e 

laplanantatlon  auac  ba  "faithful’  up  to  frequency 
Froa  the  above  discussion,  this  requires 

ea  l/(*ir^f5) 


Tha  laplaaencaclon  la  Chen 


*2  -  *1 

Xj^  -  L(X2+u)/€ 


which  has  cransferfunecion 


H,(8)  -  8Vtl^*a2] 

Faithfulness  yields  then  Chat 

H,(a)  -  a 

The  above  heuristic  Ideas  thus  aeea  to  work. 
However  soaa  aora  quantitative  work  and  precise 
definitions  will  be  needed.  In  particular  it  was 
fro*  tha  above  axaapla  that  In  singular 
probleas,  a  regularization  should  ba  defined  for  which 
the  ayataa  bacoaaa  purely  <^n>aalcal.  As  usual  thla  la 
dona  with  soae  e-paraaecer  which  one  lata  tend  to  zero. 
It  la  hare  howavar  that  problaa*  arise.  The  bahaviour 
whan  a  goaa  to  Infinity  la  highly  dapandant  on  how  < 
approachaa  to  zaro  in  relation  to  a.  In  normal 
machsMClcal  parlance,  whan  on*  consldara  axtanalona, 
on*  consldara  aaquances.  Introduces  a  notion  of 
aqulvalenca,  and  than  considers  the  equivalence  classes 
as  Che  axtandsd  sat,  wherein  tha  original  sat  la  nicely 
tmbaddad  by  napping  an  alamenc  x  fron  the  original  sat 
to  the  saquanc*  (x,x,x,x, . . . ) .  The  construction  la  all 
too  fanlllar  froa  the  construction  of,  for  Instance, 
the  reals  fron  tha  raclonals.  Hare  one  Idanclfies 
Cauchy  aaquances  approaching  tha  sane  llaic,  l.a.  tha 
equivalence  claaa  of  all  such  Cauchy  saquanc**  define* 
the  real  nunbar.  Howavar,  thla  equivalence  la  too 
coarse  for  aon*  application*.  Indaad,  using  Cauchy 
saquaneas,  tha  notion  of  ’rata  of  eonvarganca*  la  loat. 
as  the  aaquances  (1/n)  and  (1/n  )  define  the  sane  real 
(0),  but  are  clearly  ranarkably  dlatlnguishabl* . 

It  la  In  thla  aana*  also  that  the  singular  syscens 
ara  Unit  point*  of  Cauchy  saqxiancas  of  regular 
systans.  Such  Id***  have  already  been  uaad  by 
Hlnrlchaan  and  O'Halloran  In  [8],  generalizing  the 
Ides  of  Young,  Kokotovlc  and  Dckln  In  [1*1  on  high 
gain  feedback.  Only  the  behavior  at  Infinity  la 

aa8>lguous.  Thla  need  not  b*  so.  If  one  taka*  the  rata* 
of  eonvarganca  of  thasa  Cauchy  saquaneas  Into  account, 
as  wa*  clearly  shown  In  tha  above  differentiator 
axaapla.  Indaad  for  aoaa  rate*  the  bahaviour  at 

infinity  will  ba  Identical  to  that  of  tha  singular 
systaas  a*  daacrlbad  In  [111-  laplaaantatlon*  (l.a. 
Cauchy  aaquanca*  of  regularized  realizations)  with  such 
lapllad  rates  of  convergence  will  b*  called  faithful. 

Clearly  then  we  ara  faced  with  tha  problaa  of 
charactarlzing  a  singular  systaa  aa  a  Cauchy  saquanc* 
of  regular  syataaa,  while  retaining  the  Information 
regarding  the  rata  of  eonvarganca,  pertaining  to  the 
atruetura  at  Infinity.  Wa  need  clearly  nor*  structure 
In  our  aqulvalenca  than  is  usually  Implied.  It  Is  now 

known  that  also  tha  real  line  contains  auch  nor* 

structure  than  Is  usually  lapllad.  Tha  reals  can  be 
labeddad  In  the  ’hyparraals’ .  Hyparraala  can  ba 
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Identified  with_the  equivalence  classes  of  sequences  of 
reals  <a|^,a2.  •T.>.  Two  sequences  ace  considered 
equivalent  If  they  agree  a.e.  (In  a  well  specified 
■easure,  baaed  on  Che  notion  of  an  ulcraf llcac) .  The 
hyperraals  contain  chan  the  Inflnlcaaslaally  saall  and 
chair  Inverses,  Che  Infinitely  large  nuobars. 
Nonstandard  analysis  seems  therefore  Co  be  Che  right 
framework  Co  analyze  the  realization  problem  for 
singular  systems.  But  before  galloping  coo  far.  It 
must  be  mentioned  Chat  everything  Chat  can  be  shown 
with  nonstandard  analysis  can  also  be  shown  using 
conventional  Cools,  but  at  the  axpensa  of  some  more 
%rock.  Nonstandard  analysis  maraly  provides  a 
convenient  language.  First  lac  us  consider  again  the 
axaispla  of  the  differentiator  In  order  to  shad  some 
more  light  on  this  Idea. 


The  first  one  has  the  Smich-Mcrflllan  form 

■  1/X  O' 

.0  *  . 

thus  displaying  a  zero  at  i  -  0.  Tha  (-Implementation 
has  Che  Smlch-HcMlllan  form 

■  1/i  0 

.  0  (a2_,/l)/X 


Structure  at  Infinity  of  the_dlfferentl«tOt 


Faithfulness  at  A  -  0  requlras  that  A^  -  (/L  'behaves* 
as  A^,  Implying  In  Cum  chat  (  goes  to  zero  faaCar  chan 
A^. 


Consider  an  aquation  dacomposltlon  form  of  the 
dlffecanclacor 


The  zero  structure  of  Che  singular  rapraaancaclor-  la 
daCemined  by  Che  matrix 


■  1 

0  ' 

r  •  m 

*1 

■  0 

1  ' 

■  *1  ■ 

+ 

■  0  ' 

U 

'  sE  -  A  B  ' 

.  0 

0  . 

• 

*2  J 

.  1 

0  . 

.  *2  . 

.  1  . 

C  0  . 

y  -  [0  -1]  [x^,X2l' 


For  instance  for  the  Input  decoupling  zeros,  we  find 


and  Its  associated  regularized  rapraaancaclon 


[sB  -  A  B] 


1 

0 


0 

L 


y  -  to  -11  txi,X2l' 


idiich  has  full  rank  for  all  finite  s,  implying  finite 
controllability  and  reachability.  AC  Infinity,  using  a 
MSblus  cransformaclon  s  •  1/A,  the  Invariant  factors 
are  found  to  be  1/A  and  1,  Indicating  the  absence  of  an 
Ixipuc  decoupling  zero.  Incldently,  also  note  chat 
(E,B]  has  full  rank,  so  chat  the  re^lzatlon  la  also 
reachable  at  infinity  [9]tl3]. 


Their  polar  structures  are  respectively  given  by  [11] 


How  let  us  Cura  Co  the  regularized  representation. 
The  eoncrollablllcy  pencil  Is  now 


sE  -  A  - 


s  -1 

.-1  0. 


[sE  -  A  B] 


s  -10 

-L  (S  L  . 


A  A 

sE  -  A  - 


■  s  -1 
-L  (S. 


and  sines  L  ^  0,  the  lattar  has  the  sans  (finite)  zero 
atruecure  as  s  -  0.  Tha  zaro-acruecura  at  Infinity  la 
decscmlnad  by  the  zero  scruecura  at  A  -  0  of 
respaeclvaly 


and 


(1/A)E  -  A 


(1/*)  -1 
-1  0  . 


(1/A)E  -  A  - 


■  (1/1)  -1  ■ 
-L  */\ 


snd  it  has  obviously  Che  same  finite  zero  scruecura  as 
Che  singular  pencil,  as  long  as  «  converges  to  zero. 
The  infinite  zero  structure  Is  now  obtained  from  tbs 
zero  structure  at  A  -  0  of  the  reduced  form 


1/1 


10  0' 
0  A  ( 


2 

since  we  already  established  that  <  «  A  ,  this  shows 
Chat  Che  A-raallzatlon  has  no  Input  decoupling  zero  at 
Infinity. 

In  fact  note  Chat  at  Infinity,  the  ayatam  matrix 
of  Che  singular  system  and  the  realization  have  the 
same  zero  scruecura  (l.a.  no  zero  at  Infinity),  however 
chair  singular  scruecura  differs. 
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and  backaubsclcute 


Motivated  by  the  analysis  of  the  differentiator  In 
our  previous  section,  ve  now  turn  to  the  discussion  of 
representations  and  realizations.  Slnply  stated  a 
rtSll8fttl9n  or  Ireolementatlon  should  aean  a  causal 
Implanantaclon,  say  with  Integrators.  Therefore,  the 
derivatives  of  all  state -variables  oust  appear  on  the 
left  hand  side  of  an  equation  with  coefficient  one, 

i  *-  ^realization  ~  ^  r<ipr9g9ntatl9n  or  dt5&tlB£ign 

Is  nothing  but  a  set  of  equations  (dynaolc  and 
algebraic)  that  oust  be  satisfied  by  the  variables  In 
the  discourse:  Inputs,  outputs  and  the  x'a.  Ua  refer 
to  these  also  as  the  generalized  system  equations. 

Obviously  representations  (descriptions)  cary  the 
saoe  Inforaatlon  under  restricted  system  equivalence 
[8].  On  the  other  hand  restricted  equivalence  cannot 
allowed  for  realizations  In  the  above  sense.  A 
pseudo  realization  form  Is  obtained  by  writing  the 
generalized  system  equations  In  the  so-called  Second 
Equivalent  Form  [4] ,  which  decouplea  the  dynamical 
equations  from  Aa  algebraic  constraints.  This 
decomposition  reflects  the  physical  meaning  of 
Interconnected  regular  subsystems . 


•  *1  -1 

“  ^^11  ""  ^12^22^21^*1  ^^1  *  42282)u 


y  “  (Cj  —  C2A22A22)^1 


In  this  section  we  describe  the  dynamical  solution 
of  a  sec  of  linear  equations  y  >  Ax,  trtiare  A  Is  sqtxare 
and  Invertible.  This  Is  Indeed  a  special  case  of  the 
representation,  having  no  dynamical  elements,  but 
containing  2n  ’states*.  However,  we  can  model  this  by 
some  pseudo-realization 


y]  [Py  + 
.oj  [ly  - 


*1  *11  *12  1 r  *1  ]  r  ®i 

“  +  u 

0  J  L  *21  *22  J  L  *2  -I  L  *2  . 


and  thus  the  algebraic  subsystem  has  dimension  n. 

The  < -realization  yields  for  the  fast  subsystem  the 
“observer* 


In  contrast,  the  Velerscrasa-form  or  First- Equivalent 
Form  seems  to  be  more  useful  for  characterizing  Che 
solutions  of  the  generalized  system,  and  Its  associated 
observability  and  reachability  properties.  An 
equivalence  leaving  the  above  structure  Invariant  is 
obtained  by  left  and  right  multiplication  by 
respectively 


X  -  -LAx/e  +  ly/e 


which  has  the  solution 


x(t)  -  A‘V  +  axp(-iAt/<)  (X  -A‘V) 


■'^11 

’>12' 

and 

0  ■ 

.  0 

«22. 

^22  . 

where  (TJ, ,  , 
V12  and  U21 


V22)  6  Cl(nj^)xGl(n2) ,  and 
arbitrary. 


Now  obtain  an  Implemencable  realization  from  the 
pseudo  realization  by  regularization.  Because  of  its 
form  we  refer  to  this  as  the  c -realization.  This 
simply  means  that  an  <X2  Is  placed  where  the  zero 
appears  In  the  pseudo  realization.  In  view  of  the 
equivalence  described  above  ve  shall  Just  consider  that 
algebraic  set  of  equations  Is  replaced  by 


For  Instance,  the  choice  L  ••  A'  yields  a  balanced 
realization  (CV],  If  one  lets  e  •  1.  In  this  case  we 
actually  have  no  reference  for  how  fast  *fast*  really 
Is,  since  there  Is  no  dynaisical  eqation.  This 
balancedness  Is  with  respect  to  the  dlscurbablllty  due 
to  the  measurement  error,  and  the  reconscruccablllty  of 
y  from  x,  thus  respectively  the  equations 


X  -  -A'Ax  -  A’y 


X  -  -A'Ax 


y  «  Ax 


^2  *  L(A22^2  ^  *21*1  ^  82^*^/* 


But  notice  that  this  Is  nothing  but  an  observer,  with 
Infinitely  _large  gain  L/« ,  for  the  system  with  state 
X2,  and  (A,C)  -  (0,-A22)i  receiving  an  “output- 

■•••urement*  y  «  *21*1'*'^1^  This  “fast*  system  Is 
observable  If  ”  *>2,  l.e..  If  A22  has  full  rank. 

Since  this  rank  Is  Invariant  under  the  above  described 
equivalence,  lack  of  observability  of  the  fast  system 
cannot  be  overcome,  unless  a  second  regularization  la 
brought  into  the  picture,  e.g.  A22  — >  A22*>‘l,  thus 
making  the  system  (0,A22'*’4*1)  observable.  Notice  that 
8*"*'f*H’**<l  systems  for  which  A22  has  full  rank  are  In 
fact  redundant  representations  of  regular  systems, 
since  we  can  always  solve  for 


*2  *  ""*22^*21*1  ■^82'*^ 


Their  associated  reachability  and  observability 
grammlans  equal  HI. 


Conclusions 

This  paper  characterized  the  implamantabillty  of  a 
generalized  system  In  terms  of  the  ability  to  build  an 
Implementation  that  assymptotlcally  remains  faithful  to 
the  properties  of  the  original  description.  The 
practicality  of  this  lies  in  the  fact  that  all  physical 
signals  are  inherently  bandllmlted,  and  the 
description,  more  particularly  Its  structure  at 
Infinity,  gives  then  an  Idea  of  how  “fast*  practical 
Implementations  should  work.  This  way  the  tortoise  can 
outrun  the  hare  I  Rates  of  convergence  are  Important, 
and  therefore  we  propose  to  use  the  language  of 
nonstandard  analysis.  This  has  actually  not  bean  dona 
yet  In  this  preliminary  version  of  the  paper,  as  ve 
were  merely  Interested  In  damonstatlng  the  feasibility 
of  an  Idea.  For  the  purpose  of  an  orientation  In  this 
field,  an  appendix  on  nonstandard  analysis  Is  Included. 
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Appendix:  Nonatandard  Analyst* 


Nonstandard  analysis  Is  a  modem  approach  to  using 
Inflnlcesslmals  In  analysis  to  express  limits  and  Its 
derived  notions.  The  theory  Is  originated  by  Abraham 
Robinson  and  modeled  after  Leibnitz's  theory  of 
Inflnltesslmals.  Its  notion  of  ‘Infinitely  close*  Is 
useful  In  representing  limits,  not  Just  on  the  real 
axis,  but  also  In  a  topological  sense,  and  even  in 
contexts  where  the  notion  Is  not  exactly  topological. 
This  Is  the  notion  which  will  make  It  useful  to  study 
the  theory  of  singular  systems. 

The  essential  Ingredient  of  the  nonstandard  theory 
Is  the  observation  that  the  real  line  allows  for  a  much 
richer  structure  than  It  Is  usually  endowed  with. 
Uhereas  classically  the  reals  are  defined  as  Dedaklnd 
cuts  or  Cauchy  sequences,  the  richer  structure  Is 
obtained  by  a  similar  procedure,  but  using  Instead  the 
notion  of  a  free  ultrafllter.  Hence,  besides  the  usual 
reals,  which  will  be  called  standard,  the  new  set  of 
hyperreals  will  also  contain  additional  "nonstandard" 
'elements.  Intuitively  speaking  the  new  elements  build 
up  a  universe  of  Inflnltesslmals  near  each  standard 
real.  Every  element  In  this  universe  is  Infinitely 
close  to  the  given  real.  Infinitely  close  means  that 
the  distance  Is  smaller  than  any  nonzero  ordinary 
standard  real.  The  fact  that  this  Is  all  brought  on 
firm  logical  foundations,  makes  the  rules  for 
manipulating  Inflnltesslmals  rigorous.  But  not  only 
are  ‘Infinitely  small*  numbers  brought  in,  the 
hyperreals  also  contain  the  ‘Infinitely  large*  numbers, 
the  Inverses  of  the  inflnltesslmals.  Once  the 
struemre  of  the  hyperreals  Is  defined,  it  Is  possible 
to  speak  of  nonstandard  functions,  operators,  and  other 
machematlcal  objects  in  the  same  vain.  Horeover,  all 
Che  theorems  of  Che  ordinary  standard  mathematics  apply 
In  this  enriched  universe,  provided  of  course  chat  they 
are  appropriately  Interpreted.  This  property  Is 
referred  to  as  the  Transfer  Principle,  or  Leibnitz's 
Principle,  as  ha  proposed  that  all  Inflnltesslmals 
should  obey  Che  same  rules  as  ordinary  standard  numbers. 

Transfer  Principle 

The  weak  form  of  the  transfer  principle  postulates 
chat  for  every  standard  formula  F(x)  having  no  other 
free  variables  chan  x,  we  have 

F(x)  -  Vx  F(x) 


or,  equivalently, 

3x  F(x)  •  3*^  F(x) 

where  V*St  F(x)  means  Vx  [sc(x)  <•  F(x)l  and  3*'x  F(x) 
means  3x(sC(x)  and  F(x) ] .  Here  sc(x)  indicates  that  x 
Is  standard,  l.e.  an  alamenc  In  the  usual  discourse  of 
mathematics . 

Of  course  it  is  also  possible  to  project  the 
hyperreals  on  the  reals.  The  operation  Is  referred  to 
as  Caking  the  Standard  Part.  This  standard  part  Is 
alto  defined  for  ocher  machemaclcel  objects.  A  typical 
approach  In  nonstandard  analysis  Is  to  obtain  a 
continuous  standard  object  from  a  discrete  nonstandard 
object.  Inflnlcary  machenaclcs  Is  obtained  from 
flnlcary  machemeclcs  (the  nonstandard  construction). 
The  complete  nonstandard  solution  to  a  standard  problem 
consequancly  Involves  first  of  all  a  ‘lifting*  of  the 
given  standard  problem  Co  a  nonstandard  one,  of  idilch 


It  Is  the  standard  part.  Next  comas  the  nonstandard 
solution  which  Is  usually  flnlcary.  Then  one  shows 
chat  the  standard  part  to  the  solution  exists ,  emd  chat 
this  solves  also  the  standard  problem.  Free  movemanC 
between  the  standard  world  and  the  nonstandard  one  Is 
allowed  by  Che  ‘Transfer  Principle*,  and  the  "Standard 
Parc*  map.  Such  an  approach  eliminates  much  of  Che 
burden  of  modem  mathematical  rigor,  since  It  deals 
rather  simply  and  In  a  more  naive  way  with  the 
Inflnlcesslmals  and  the  Infinitely  large.  In  order  to 
Illustrate  Its  power,  consider  the  criterion  for 
continuity  In  nonstandard  analysis: 

Given  f:R  — >  R,  a  standard  function  and  x^  a 
standard  real  number.  Then  f  Is  continuous  at  Xq  if 
and  only  If 

V4-0  f(Xg+«)-f(x„) 

The  equivalence  >  Identifies  numbers  Chat  are 
InflnlCessimally  close,  l.e.  x>y  Is  equivalent  to 
stating  that  x-y  Is  Inflnlcesslmal. 

Of  course  It  has  the  disadvantage  chat  the  language  Is 
not  yet  common,  especially  to  Che  nonspeclallsc  who 
should  be  using  It. 

A  very  readable  Introduction  Co  nonstandard 
analysis  Is  Che  recent  book  (In  French)  by  Dlener  and 
Reeb  [5].  An  approach  via  nonstandard  analysis  Co 
probability  Is  for  Instance  described  In  [10] .  It 
contains  a  remarkably  simple  and  concise  Introduction 
to  nonstandard  analysis.  Applications  to  singular 
perturbation  theory  of  ordinary  differential  equations 
are  described  In  [2]  and  [16].  As  far  as  this  author 
knows,  Che  only  attempt  to  introduce  nonstandard 
analysis  In  systems  theory  was  [3],  In  the  context  of 
Instantaneous  scabtllzablllcy  and  Its  robustness 
properties.  The  recent  book  [1]  on  nonstandard 
analysis  has  a  chapter  devoted  to  differencial 
operators.  Its  Introduction  deals  with  calculus, 
topology  and  linear  spaces  In  nonstandard  machamaclcs. 

It  Is  this  authors  aim  to  see  If  by  rephrasing  Che 
singular  system  problem  in  the  language  of  nonstandard 
analysis,  soma  of  Che  obscurity  presently  cloaking  the 
theory  cannot  be  allminscad,  and  thus  Its  applicability 
enhanced  to  a  wider  comnunlcy. 
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Abstract 

For  a  class  of  randomly  switched  linear  systems  a 
transition  functional  is  introduced.  It  is  shown  that 
the  expectation  of  this  function  at  time  t.  satisfies  a 
hyperbolic  partial  differencial  equation,  which  plays  a 
similar  role  as  the  backwards  Kolmogorov  equation  for 
diffusions.  Its  formal  adjoint  leads  to  the  forward 
equation,  and  their  complexity  is  determined  by  the  Lie 
algebra  associated  with  Che  sec  of  values  assumable  by 
the  dynamic  matrix  A(C,ij).  The  usual  PARABOLIC 
Kolmogorov  equation  is  derived  from  this  as  a  limiting 
case.  The  result  leads  to  Monte-Carlo  simulation 
methods  for  solving  hyperbolic  PDE's. 

Introduction 

In  this  paper,  the  class  of  stochastically 
switched  systems  is  considered.  where  Che  system 
parameters  are  piecewise  constant,  and  assumes  only  a 
finite  number  of  values,  i.e.  Kc.u)  e  . .  .Z^) . 

The  applications  of  such  models  are  widespread:  from 
target  cracking,  where  the  parameter  change  occur  aa 
changes  in  acceleration,  bank  angle  etc.,  Co  fault 
tolerant  control,  the  different  modes  being  associated 
with  different  failure  modes.  These  hybrid  systems 
have  also  been  used  as  approximations  for  certain 
nonlinear  systems  [li  In  this  paper,  H  -  2,  but  the 
generalization  is  straightforward  although  of  increased 
coisplexlty.  The  switching  phenomenon  is  assumed  to 

occur  at  purely  random  times,  and  its  stochastics 
stationary.  Let  Che  probability  of  a  switch  in  an 

infinlcessimal  interval  of  length  dt  be  Adt.  If  N(c} 
denotes  the  number  of  switches  in  the  finite  interval 
[0,c),  then  it  is  well  known  that  the  probability  that 
N(t)  equals  k  is  given  by  Che  ubiquitous  Poisson 
formula 

Prob  IN(c)-k)  -  o~^'^(Ac)'‘AI  d) 


will  be  called  a  transition  funccionel  From  ,>r» 

"zero"  reflects  the  time  at  the  InitLnl  •••/•nr. 
of  Che  scationarlcy  of  the  switching  process.  T.r,  ,oss 
of  generality  results  from  this.  The  set  of  r.rsr.sl  •  :or. 
functionals  characterizes  the  state  tr-or.s  i ;  or. 
completely,  and  therefore  the  complete  evolution  of  the 
driven  system  as  well.  Here,  u(c)  is  some  auxlllsr/ 
signal,  for  instance  the  input.  If  u(tj  «  0,  ar.d  e 
selects  the  i-th  component  of  its  argument,  Chen  the 
transition  functional  tp  is  the  1-Ch  component  of  the 
translation  operator  [4J. 

Main  Results 

Consider  Che  alternating  undriven  linear  system 

dx(t)/dc  ■■  A(C.w)x(C) ,  (3) 

where  the  random  matrix  \(C,u)  assumes  the  values  A, 
and  A^  according  to  a  random  switching  process.  For 
instance  the  matrix  process  A(C)  is  patterned  after  a 
telegraph  signal,  based  on  the  number  K(C)  of  random 
points  in  Che  interval  [0,C).  If  this  number  is  even, 
Chen  A(C}  »  A.^,  whereas  if  it  is  odd,  Chen  A(C)  ••  A,. 

Define  also  Che  averaged  dynamic  matrix  and  its 
"excursion*  respectively  by  •  (A.^+A.)/2 
and  0  -  (A^-A.)/2.  An  approximate  solution  of  the 

random  system  Is  first  obtained  by  discretizing  the 
time  in  steps  of  length  d,  and  letting  Che  switching  be 
commensurate  with  Chase  sampling  times.  The  matrix 
A(nd)  can  be  expressed  as 

Ag  +  . .  «]^Q,  (4) 

where  is  a  Bernoulli  (1,-1)  process,  with  Pr((— 1)  - 
AA.  A  switching  at  step  k  corresponds  then  to  -  -1. 
Hence  Che  stats  x(C),  if  A(0)  -  A^,  is  the  limit  for 
A-t/n  -►  0  of 


Furthermore,  If  the  switching  times  are  ordered,  uj  < 
Cj  <  ...  <  Cjj.  then  the  increments  N(C2)-N(C,),  N(c.j)- 

N(C2) .  N<t|j)-N(C|j. j^)  are  Independent.  Our  method 

generalizes  a  result  by  Kac  (21,  where  the  one- 
dimensional  motion  of  a  particle  with  constant  speed  v, 
but  wit!,  randomly  (at  Poisson  Claes)  reversing 
direction  was  considered,  i.e,  Che  first  order  affine 
system  dx/dc  6  (v,-v|.  In  his  paper,  the  expectation 
of  any  function  of  the  position  was  Incarpraced  as  a 
solution  to  the  wave  equation,  but  with  a  "random  path 
time"  substituted  for  the  real  time,  followed  by 
averaging  over  all  paths.  This  led  to  Monte-Carlo 
methods  for  solving  the  related  Kleln-Corden  equation, 
which  is  significant  in  quantum  electrodynamics,  and 
factors  to  the  Dirac  equation  (3J. 

First  we  define  the  notion  of  a  general  transition 
functional:  u'  R  where  is  a  smooch 

map  (linear  or  nonlinear)  from  Che  state  space  a"  to 
a,  and  F^  implicitly  defined  by 


-  ip(x(C)) 

dx(c)/dc  -  f(x(c),u(c),t)  ;  x(0)  -  x 


(2) 
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fI+(Ao+e„s„.i. .  .SiO)AH1+(Ao+*„.i<„.2-  ■  ‘iQIAJ  . .  . 
...[I+(Ag+ej^O)A]Xg 

The  state  transition  matrices  P'*'  and  P'  follow 

pi  -  ii+(A„±.„.„.i....in)A)[i>(A„±.n.i.„.2....in)A]...^ 

...[I+<Ao±.iO)A]. 

The  expected  transition  functionals  are  now  obtained  by 

F^(x)  -  E  ,»(P^] ,  (6) 

and  by  caking  the  expectations  over  separately  one 
derives  the  recursions 

f;;(x)  -  (1-AA)F*.2((I+(Ao40)a)x1  +  AAF;;.i[(I+(Ao-fl)A)xl 

F'(x)  -  (l-AA)F'.i(<I+(Ao-0)A)xl  +  AAF^.^t (I+(Ao+0)A)xl 

By  reorganizing  the  terms,  and  caking  limits  for  A  -*  0, 
one  obtains  the  partial  differencial  system 

0/ac)Ftx,t)  -  (a/ax)Ftx.t)A.x  +  A[F(x.t)-Ftx,c)l 

(8) 

(a/3t)F(x,t)  -  (3/ax)F(x,c)A.x  ♦  A(Ftx,t)-F(x,c)J 
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Upon  setting 


G  -  [F'*’+F')/2  (9) 

H  -  (F'*’-F']/2  (10) 

this  is  equivalent  to  the  system  of  PDF's 

ac/at  -  (ac/ax)  a  x  +  (an/ax)  ox 

(11) 

aH/at  -  (aH/ax)  a^x  +  (ac/ax)  ox  -  2iH 

Its  initial  conditions  follow  directly  from  the 
definition  of  G(x,t).  Indeed,  G{x.O)  “  Es>(x)  -  (p(x) , 
since  randomization  over  A(0,u)  Just  gives  the 
identity.  The  second  set  of  conditions  follows 
similarly  from  H(x,0)“0.  This  then  proofs  the 
following  theorem. 

Theorem  1.  The  evolution  F^(x,t)  of  the  randomly 
switched  system  (3)  Is  given  by  the  following  pair  of 
PDF’s 

aF^at  -  (aF,p/ax)  a^x  +  (aH/ax)  ox 
aH/at  -  (aH/ax)  a^^x  +  (aF^ax)  ox  -  2ah 
with  initial  conditions  F^(x,0)-v(x)  and  H(x.O)  -  0. 

The  coupled  set  of  PDF's  plays  the  role  of  the 
backwards  Kolmogorov  equation  ^5^.  Introducing  the 
first  order  differential  forms  3^  «  3/3t  ,  3^  “ 
x'fl'(a/ax)',  ai.d  dg  -  x'A^(a/ax)',  the  following 
special  case  can  be  deduced. 

Corollary  2.  If  A^  and  A.  commute,  then  the  evolution 
of  the  system  (3)  is  governed  by  the  hyperbolic  PDF 

(3^*2X-3^)(3^-3^)T^-  (3j\  (13) 

with  initial  conditions  Pp(x,0)-v>(x)  and  3jF^(x,0)-0. 

Proof;  Indeed,  if  A,^  and  A.  commute,  then  so  do  A^  and 
ft.  But  then  the  differential  operators  3^  and  3^ 
commute,  and  upon  elimination  of  H(x,t)  one  obtains  the 
PDF  (13) .  That  the  Initial  conditions  are  as  stated 
follows  also  from  the  main  theorem. 

If  p*  and  p'  represent  the  conditional  density 
(assuming  it  exists)  of  x(t)  given  x(0)-x,  and  A(0) 
respectively  A^  and  A. ,  then  it  follows  from 

3/3t<p'*’ ,<p>  -  <p'*’,  [x'AJ.(a/ax)-A)v>  <s'.As>> 


same  effect  as  a  diffusion.  The  drift  is  however  NOT 
the  one  given  by  th^  averaged  dynamics  A^,  but  an 
additional  drift  O^x  is  present.  This  can  be 
stabilizing  or  destabilizing,  depending  on  (}„.  For 
instance  if  (ig  has  imaginary  eigenvalues,  stabilization 
may  occur,  since  0^,^  has  then  negative  eigenvalues.  If 
on  the  other  hand  one  has  high  frequency  switching,  but 
p  remains  finite,  then  the  stochastic  energy  (the 
Integral  of  ||A(t)A(t+T) '  j)  is  zero,  and  the  dynamics  of 
the  averaged  system  is  all  that  remains. 

In  the  noncommutative  case  a  higher  order  PDF  for 
F  is  obtained.  Its  structure  depends  on  the  dimension 
of  Che  Lie  algebra,  generated  by  A^  and  A..  For 
instance  if  the  commutator  of  A,^  and  A,  is  nonzero,  but 
commutes  with  both,  then  it  is  known  [6]  that  G 
satisfies  a  third  order  PDF. 

For  A-0,  the  equations  are  readily  solved  in  terms 
of  the  characteristics  which  are  exactly  the 
deterministic  evolutions  according  to  the  different 
modes.  For  A>>0,  Che  solutions  of  the  PDF  are  still 
interpreted  in  terms  of  the  characteristics,  but  via  a 
’randra  time  operator*  [6], 

Summary  and  Extensions 

It  was  shown  that  for  randomly  alternating 
systems,  a  hyperbolic  system  of  first  order  PDF’s 
describes  the  behavior  of  the  system.  From  this,  a 
single  higher  order  PDF  results  throu^  elimination  of 
Che  auxiliary  variables.  The  mechanization  of  this 
elimination  process  and  its  ensuing  complexity  is 
determined  by  the  Lie  algebra  generated  by  the  A(t) 
values.  By  using  the  formal  adjoint,  this  can  be 
interpreted  that  under  some  smoothness  assumptions  the 
density  satisfies  a  type  of  forward  Kolmogorov  or 
Fokker' Planck  equation,  which  in  this  case  is  also  of 
hyperbolic  type.  It  was  shown  chat  asymptotically,  the 
parabolic  equations  of  diffusion  type  result,  if  the 
limits  are  taken  in  such  a  way  that  Che  stochastic 
energy  is  conserved  in  the  limiting  system.  The 
results  presented  here  were  for  linear  autonomous 
systems,  but  extend  easily  to  the  nonlinear  driven  case 
with  markovian  switching  between  a  countable  number  of 
models. 

Finally,  one  can  reverse  the  ideas  and  develop 
stochastic  solution  methods  for  hyperbolic  PDF’ s  as  was 
done  for  parabolic  and  elliptic  ones  based  on  Dynkln's 
equation  [5].  Indeed,  such  Honte  Carlo  simulation 
methods,  are  not  based  on  Che  stochastic  evolution  in 
the  (narrow)  ICo  sense  (l.e.  based  on  an  underlying 
Brownian  process),  but  on  a  counting  process. 


-  <-3/3x(/i^xp*)-Xp'*' ,,p>  +  <Xp'  .<p>  (U) 
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3p*/3t  -  -'^■{K^xp*)-\(p*-p') 
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The  density  of  x(C)  is  finally  obrsired  by 
[p*{x,t)*p'{x.t)\/2. 


(15) 


/i(x.c)  - 
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PARABOLIC  PDF  results,  which  is  the  equivalent  to  the 
Ito-differenCial  system  where  u(t)  is  a  Wiener  process 
with  Fw(t)w(s)-<3  min(C.s). 

dx  -  (Ag+ng)x  dt  +  OgX  dw(c)  (16) 

Clearly,  the  •jlttarlng"  caused  by  very  fast  switching 
over  very  large  amplitudes  in  the  direction  has  the 
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Abstract 

In  this  paper  we  study  the  dynamics  of  time- 
homogeneous  Markov  chain  models  from  a 
state-space  modeling  point  of  view.  It  is  shown 
that  a  Markov  chain  model  can  be  embedded  in 
a  2-D  realization  theory  where  markov  pa¬ 
rameters  correspond  to  higher-order  transition 
probabilities.  The  implications  of  formulating 
a  Markov  chain  model  in  this  state-space  do¬ 
main  is  that  many  equivalent  representations 
may  exist,  some  of  which  may  have  better  ro¬ 
bustness  properties.  A  modified  Hankel  ap¬ 
proximation  algorithm  is  presented  which  ex¬ 
actly  matches  all  the  Markov  parameters.  The 
algorithm  is  an  extension  of  the  2-D  harmonic 
retrieval  algorithm  introduced  in  [6]. 

1.  Introduction 

Markov  chain  models  have  been  used 
extensively  to  model  random  phenomena  with 
a  particular  type  of  dependence;  the  Markov  de¬ 
pendence.  A  stationary,  finite  state  Markov 
chain  is  defined  as  a  stochastic  process  having 
a  finite  number  of  states,  the  Markovian  de¬ 
pendence,  stationary  transition  probabilities, 

and  an  initial  set  of  probabilities  {7Cj(0))j!!o. 
Such  a  process  is  said  to  be  memoryless;  the  fu¬ 
ture  behavior  depends  only  on  its  present  state 
and  not  on  its  past  history.  Hence,  only  a  lim¬ 
ited  amount  of  information  is  required  to  prop¬ 
agate  the  conditional  distribution  of  a  Markov 
process.  Such  Markov  structure  arises  in  con¬ 
nection  with  decision  making  under  uncer¬ 
tainty  [1],  queueing  theory  [2],  hidden  Markov 
models  [3],  stochastic  dynamic  programming 
[4],  and  the  solution  of  linear  algebraic,  inte¬ 
gral,  and  differential  equations  [51,  to  name 
only  a  few. 

Although  Markov  chains  have  the  con¬ 
cept  of  state  and  Markov  propagation  property 
embedded  in  it,  there  does  not  seem  to  be  any 
connection  with  the  state-space  formulation  of 
linear  dynamical  systems.  Having  an  equiv¬ 
alent  linear  systems  theory  for  Markov  chain 
models,  one  can  select  a  canonical  representa- 
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tion  that  is  best  suited  for  implementation  or 
perhaps  is  less  sensitive  to  word  length  effects, 
limit  cycles,  etc.  Our  main  concern  here  is  to 
develop  a  2-D  realization  theory  that  yields  a  set 
of  equivalent  state-space  representations  for  a 
given  Markov  chain  model.  The  2-D  state- 
space  model  corresponds  to  a  pair  of  row/col¬ 
umn  state  equations  which  propagate  the  transi¬ 
tion  probabilities  in  space,  and  a  temporal  state 
equation  that  propagates  them  in  time.  The  ob¬ 
servations  correspond  to  the  higher  order  tran¬ 
sition  probabilities  and  thus  can  be  seen  as 
lower  level  Markov  parameters.  Finally,  the 
parameters  of  the  Markov  model  are  unique 
modulo  a  similarity  transformation  of  the 
state-space. 

A  modified  Hankel  approximation  al¬ 
gorithm  is  presented  which  exactly  matches  the 
upper  level  Markov  parameters  defined  as  the 
higher  order  transition  probability  matrices. 
The  algorithm  is  an  extension  of  the  2-D  har¬ 
monic  retrieval  algorithm  introduced  in  [61.  In 
the  following  section,  we  define  some  of  the  ba¬ 
sic  properties  of  Markov  chains.  In  section  3, 
we  present  our  2-D  realization  theory  for 
Markov  chains.  Finally,  in  section  4,  we  in¬ 
troduce  the  Markov  chain  realization  algo¬ 
rithm. 

2.  Preliminaries:  Definition  and  Properties 
of  Markov  Chains 

A  stochastic  process  {^(n))  exhibits  the 
Markovian  property  if 

p{  ^(n+l)=:j  1  ^(n)=i )  =  p{  ^(n+l)=:j  1  ^(n)=i,  .  . 

^(n-  l)=:i, ,  ^(n-2)=i j . §(0)=i„  )  ^ 

for  n  =  0,1,2,  ...,  and  every  sequence  (j,i,ii,  ..., 
i,^).  This  is  equivalent  to  stating  that  the  proba¬ 
bility  of  an  event  at  time  n+1  given  only  the  out¬ 
come  at  time  n  is  equal  to  the  probability  of  the 
event  at  time  n-t-1  given  the  entire  state  history 
of  the  system.  The  conditional  probabilities 

pij(l)  =  p{^(n+l)=j|^(n)=i)  (2) 
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are  called  one-step  transition  probabilities,  and 
are  said  to  be  stationary  if 

pgd)  =  p{  ^(n+l)=j  I  ^(n)=i  1  =  p(  ^(l)=j  1 4(0)=i 
Vn*0,1.2.... 

so  that  the  transition  probabilities  remain  un¬ 
changed  through  time.  These  values  may  be 

displayed  in  a  matrix  P(l)  =  [Pij],  called  the 
one-step  transition  matrix.  The  (NxN)  matrix 
P(l)  satisfies 

OSpgCDsi  (4a) 

£  pyd)  =  1  for  0  S  i  ^  N-1  (4b) 

j-o 

The  existence  of  one-step,  stationary 
transition  probabilities  implies  the  existence  of 
higher-order  transition  probabilities,  which 
can  be  computed  from  the  Chapman-Kol- 
mogorov  equations,  i.e., 

N-l  ■ 

Pij(h+S)=  X  Pl.»(k)Pmj(s)  .•  (5) 

maiO 

V  k,  s  =  0,  1,  2,  ...  and  0  5i,j5N-L  Here 

Pij(k)  =  p{  ^(n+k)=j  1  ^(n)=i  1  are  called  kth-step 
transition  probabilities,  and  may  be  displayed 

in  a  kth-step  transition  matrix  P(k)  s  [pij(k)], 
where,  in  general 

0 S Pg(k) 51  V  k  s  0, 1, 2, ...  ..  . 

and  05ij5N-l 

X  py(k)  =  l  V  k  =  0,l,2,  ... 
j«o  (6b) 

and  05i5N-l 

It  should  be  noted  that  pij(O)  =  Sij  (Kronecker 
delta),  thus,  P(0)  =  In  (NxN  identity  matrix). 

The  unconditional  probability  of  {^(n)) 
being  in  state  j  at  time  n  =  k  is  given  by 

Jtj(k)  =  p{  ^(k)=j  1  =  X  ni(0)ptj(k) 

(7) 

Vk  =  l,2,...  and  05j5N-l 

and  in  row  vector  form 

«(k)  =  [  ji,(k)  iii(k)  xa(k)  ...  kn-A)  ]  (8) 

In  general,  for  irreducible,  ergodic  Markov 

chains,  the  steady-state  probabilities  Itj  are 
independent  of  i,  i.e., 

lim  Pij(k)  =  lim  „/k)  =  n, 
k-»«»  k-^o* 


and  satisfy  the  following  conditions: 


0<  7^  5  1 

(9a) 

II 

(9b) 

N-l 

X  Kipij(l) 

i-0 

(9c) 

and  0  5j5N-l 

Finally,  for  P=  lim  P(k) ,  X=1  is  the  only 
k  — » “o 

nonzero  eigenvalue  and  n  and  1  =  [  1  1 ...  1  f 
are  its  left  and  right  eigenvectors,  respectively. 
The  interested  reader  may  consult  references 
[7],[81  for  further  details  on  Markov  chains. 

3.  2-D  realization  Theory  for  Markov 
Chains 

Consider  a  2-D  state-space  model  such 
as 

Xi*ij(k)  =  AiXij(k)  (10a) 

xu»i(k)  =  AaXij(k)  (10b) 

Pij(k)  =  cxij(k)  (10c) 

where  xij(k)  is  an  (Nxl)  state  vector.  At  and  As 
are  (NxN)  constant  matrices,  c  is  a  (IxN)  vec¬ 
tor,  and  Pij(k)  is  a  scalar  measurement  corre¬ 
sponding  to  the  (ij)th  element  of  the  kth-order 
transition  matrix.  The  dynamics  of  the 
Markov  chain  can  be  incorporated  by  allowing 
the  state  vector  to  vary  with  transitions,  i.e, 

Xij(k+1)  =  Wxij(k)  (11) 

where  W  is  an  (NxN)  transition  matrix.  In 
addition  we  assume  that  Ai  and  Aa  are  stability 

matrices  and  the  pairs  (Ai,  c)  and  (As,  x^(0)) 
are  observable  and  controllable,  respectively. 
If  we  recursively  solve  the  state  equations  (10a), 
(11),  and  (10b),  and  substitute  them  in  (10c),  we 
find  that 

Pij(k)=cAiW‘‘Ai3u..(0)  (12) 

corresponds  to  the  Markov  parameters  of  the  2- 
D  model  (lO)-(ll).  However,  it  should  be  clear 
from  our  2-D  model  that  the  matrices 
Ai,  As,  and  W  must  commute  with  each  other. 
As  we  will  see  later,  the  constraints  imposed  by 
P(0)  does  not  allow  this  commutativity  property 
to  hold.  This  imposes  a  constraint  on  the  order 
in  which  the  state  equations  can  be  updated.  In 
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order  to  avoid  any  confusion  with  this  partial 
ordering,  we  rewrite  state  equations  (10b)  and 
(11)  as 


x.j«i(k)  =  Aax.j(k)  (10b') 

x.j(k+l)  =  Wxoj(k)  (11*) 

Notice  that  (11’)  implies  x,j(k+s)  =  W^Aix.,,(s). 
Also,  the  order  of  state  updates  is  j-k-i  (column, 
time,  row),  which  leads  to  the  Markov 
parameters  (12). 

The  solution  to  the  Chapman-Kol- 
mogorov  equations  yield  the  higher-order 
transition  probabilities,  i.e., 

N-l 

Ptj(k+s)  =  X  Pi.»(lc)Pmj(s) 

msO 

=  5^  cAiA“x...(k)cAr'Aix,..(s) 

msO 

=  cAlW  j  £  A?xU0)cAr 

=  cAiW'‘wl(0)Aix...(s) 

=  cAiAix,.,(k+s) 

where 

W«(0)  =  £  Aj'x.,.(0)cAr  (14) 

nwO 

can  be  thought  of  being  a  cross  Grammian  [9] 
having  joint  observability  and  controllability 
properties  and  satisfying  the  following  Gram¬ 
mian  equation 


Proof:  Recall  that  P(0)  =  OC(0)  =  In,  where 


c 


0  = 


cAi 

cAf 


cAf 


N-l 


C(0)  =  [  x...(0)  Aax...(0)  A|x...(0)  -  -  -  Ar'x,..(0)  ] 


and  by  the  observability  and  controllability  as¬ 
sumption  (p(0)  =  p(C(0))  =  N),  O  =  C(0)‘\  Fur¬ 
thermore,  0x^(0)  =  ei  (the  first  element  of  the 
standard  basis  in  R'')  and  OAix^(O)  =  0,  thus 
Aix,^(0)  =  0  implies  that  X4m>(0)  is  an  element  of 


N.p(A,).  In  fact,  spanf  )  =  N^AO.  A 

II  WO)  ir 

similar  argument  implies  cA3C(0)  =  0^  and  thus 
c^  is  an  element  of  the  left  null  space  of  Aa.  The 
normalization  comes  from  p^O)  =  cXaJiO)  =  1. 
To  prove  property  (ii)  we  need  to  make  use  of  the 
Cayley-Hamilton  Theorem  which  states  that 

A(Ai)  =  A(As)  =  0  (characteristic  polynomial), 
i.e.,  V  OsjsN-l 

AJ*  =  -  aN-iAi*"’  -  on-jAi*’* . OoIn 

cArAix...(0)  =  -  a^OAix...(0)  (16) 

=  -o%  =  0 


W..(0)  =  A2W„(0)A,  +  x...(0)c  (15) 

We  remark  that  (15)  requires  N  to  be  large  or 
A"  =  Aa  =  [0]  V  m  SN.  In  the  following  theorem 
we  prove  the  latter  case,  along  with  other 
properties  from  (Ai,  Aa,  W,  c,  x,.,(0))n. 

Theorem  1:  Given  an  Nth -order  2-D  realize* 
tion  (Ai,  Aa,  W,  c,  x^(0))n,  the  following  proper¬ 
ties  have  to  be  satisfied  in  order  for  it  to  charac¬ 
terize  a  Markov  chain: 

i)  x,.,(0)  e  Nip(Ai)  and  c’’  €  NhXAJ) 

such  that  cx.«(0)  =  1 

ii)  Ar  =  A3  =[0]  V  mSN 

iii)  AaAi  =  In  -  3W0)c 

iv)  p(Ai)  =  p(Aa)  =  p(AiAa)  =  N-l 

v)  X4(A,)  =  X*(Aa)  =  0  ;  k  =  l,2,...,N 
and  X(AaA,)  =  (l,  1,  1,  -.,1,0) 

vi)  AiAaAi  =  Ai  and  AaAiAa  =  Aa 

vii)  Ai,  Aa,  and  W  cannot  commute 

with  each  other 


thus,  cA^CfO)  =  0^  and  since  c’’  e  NgpfAj),  it  fol¬ 
lows  that  Ai*  =  [0].  A  dual  argument  can  be  used 
to  show  that  Aa  =  [01  Property  (iii)  follows  from 

the  fact  that  Wc.(0)  =  C(0)O  =  In  and  the  use  of 
(15).  To  prove  property  (iv)  we  need  the  follow¬ 
ing  identities  from  [10,  pp.  140  - 1411: 

I  l-cx.*(o)|  =  |  In-x,..(o)c| 

I  x,„(o)c  -  pIn  1  =  (-p)"'’!  cx«,.(o)  -  p| 

which  implies  that  p=l  is  the  only  nonzero 
eigenvalue  of  x.«(o)c,  therefore,  the  eigenvalues 
of  AaAi  are  X.  =  (p  - 1)  =  1  with  multiplicity  (N-l) 

and  X.  =  0.  The  same  holds  true  for  A|Aa,  hence, 
rank(AiAa)  =  N-l.  Now,  since 

p(AiAa)  S  min[p(Ai),  p(Aa)]  S  N 
N-l  S  min[p(Ai),  p(Aa)]  S  N 

we  know  that  the  lower  bound  is  satisfied  since 

the  dimension  of  the  null  spaces  of  Ai  and  Aa  is 
at  least  one  (it  is  indeed  one),  therefore. 
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the  left  by  m  columns.  Then  by  the  previous 
eigenvalue-eigenvector  properties,  one  can 
show  that  N(N-l)  rows  of  0(Ai,  Aa,  c)  and 

columns  of  C(Ai,  Aa,  are  repeated.  This 

proves  properties  (i)  and  (iii)  since  rank(O)  s 
rank(C(0))  =  N  by  definition  of  I^).  To  prove 
property  (ii)  it  is  ea^  to  show  that  H  can  be  con¬ 
structed  so  that  column  blockQ+l)  =  P(l)column 
blockCi)  and  row  block(i-fl}  s  P(l)row  block(i). 
Therefore,  P(0)  is  the  only  block  in  H  that  is  in¬ 
dependent  of  the  others,  and  is  of  full  rank.  Q 

4.  MarkovChainRealizationAlgorithin 

Given  P(l),  we  can  form  the  Hankel 
matrix  using  2K-2  Markov  parameter  matrices 


from  P(k)  =  m)\ 

i.e.. 

P(0) 

P(l) 

P(K-l) 

P(l) 

P(2) 

P(K) 

H  = 

c 

.  P(K.l) 

P(K) 

P(2K-2)  J 

OC(0) 

OWC(O) 

..  ow''  'c(0)  ■ 

s 

OWC(O) 

OW’C(O) . 

..  OW*C(0) 

1 

0 

9i 

0 

0 

OW*C(0). 

..  OW®*C(0). 

s 

oc 

(20) 

where  P(k)  is  given  by 


The  Markov  chain  realization  algorithm  con¬ 
sists  of  a  pair  of  upper/lower  level  steps  to  de¬ 
termine  [O,  C(0),  W]  and  [Ai,  Aa,  c,  x^(0)]n.,  re¬ 
spectively.  Both  steps  can  be  achieved  through  a 

singular  value  decomposition  (svd)  of  H,  i.e., 

H  =  urv^=oc 

q  =  U2“  (23) 

where  U  is  a  (KNxN)  orthonormal  matrix,  £  is 
an  (NxN)  diagonal  matrix  containing  the 
Hankel  singular  values,  and  V  is  a  (KNxN) 
orthonormal  matrix.  The  parameters  are  ob¬ 
tained  from 

Upper  Level  Parameters:  [0,  C(0),  W] 

0  =  first  (NxN)  block  of  O 
C(0)  =  first  (NxN)  block  of  C  (24) 

w  =  [oTod’oJo,  = 

where  Of  consists  of  the  first  N(K-l)  rows  of  O 
and  O)  of  the  last  NOC-l)  rows  of  0.  The  same 
definition  applies  to  C  . 

Lower  Level  Parameters:  [Ai,  Aa,  c,  x«,.(0)]n 

c  *  first  (IxN)  row  of  O 
x,..(0)  =  first  (Nxl)  column  of  C(0)  ^25) 

A,  =  C(0)O^ 

Aa  =  CTO)0 


cW'x,^(0) 

cW'AaxUO) 

...  cW‘A2~'x,..(0) 

cA,W‘x^(0) 

cAiW%x.^(0) 

...  cAiW‘A2'’x^(0) 

P(k)  = 

cAfW^x^fO) 

cA?W^Aax^(0) 

...  cA?W’‘A?‘xU0) 

=  OC(k) 

(21) 

.  cAr-'W WO) 

cAr‘W"AaX^(0) 

...cAr'W%'^‘x^(0). 

and  represents  a  lower  level  set  of  Markov  pa¬ 
rameters.  Similarly,  the  upper  level  observ¬ 
ability  and  controllability  matrices  are 

C=[C(0)  WC(0)W*C(0)...  W''  'C(0)  ]  (22a) 

O 

0  w 

0=1  ow*  I  (22b) 


K-l 


OW^' 


where  O^  is  equal  to  O  shifted  upwards  by  one 
row,  the  last  one  being  a  (IxN)  row  of  zeros 

since  Al*  =  [01;  and  C(0)  is  C(0)  shifted  to  the  left 
by  one  column. 

We  should  point  out  that  since  p(H)  =  N, 
OChas  Hankel  structure,  therefore,  the 
following  Hankel  norm  property  is  satisfied 


II  H  -  0C||  =  (26) 

which  is  of  the  order  of  machine  precision. 
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p(Ai}  =  pCAa)  =  N-1.  Property  (v)  follows  from 
properties  (ii)  and  (iv)  since  every  nilpotent 
matrix  has  all  its  eigenvalues  equal  to  zero. 
Property  (vi)  follows  from  properties  (i)  and 
(iii),  which  implies  that  Ai  and  Aa  are  g-in- 
verses  of  one  another.  To  prove  property  (vii) 
we  need 


cAa 

OAa=  cA| 


Ca,(0)=[xo.o(0)  AiXo.o(0)  ...  A^'xo.o(0)  ] 

and,  if  we  recall,  Oai  =  O  and  CAa(O)  =  C(0). 
Now,  suppose  P(0}  =  OAaCAi(O)  =  In,  then  one  can 
show  that  cAaCAiCO)  =  0^  ^  ej  since  c'^  e  N«p(Aa}, 
therefore,  AiAa  *  A3A1.  Furthermore,  it  can  be 
shown  that  cAiAa  =  c  and  AiA2X«^0)  =  x«^(0), 
thus,  we  have  cWAiA2X,j,(0)  =  cWx«..(0),  for 
instance.  A  similar  argument  shows  that  W 
cannot  commute  with  A3  either.  Q 

We  now  establish  the  equivalence  be> 
tween  Markov  chains  characterized  by 
[n(0),  P(1)]n  and  a  state-space  realization  char¬ 
acterized  by  [Ai,  Aa,  W,  c,  x,^(0)]n. 

Theorem  2:  A  Markov  chain  defined  by 
[n(0),  P(1)]n  is  equivalent  to  a  2-D  state-space 
realization  [Ai,  Aa,  W,  c,  x^(0)]n  provided  this 
one  satisfies  the  properties  of  Theorem  1. 

Proof;  Since  we  know  that  P(k)  =  P(l)‘‘’  we  can 
use  these  as  Markov  parameters.  Let  us  now 
form  the  Hankel  matrix  from  these  higher- 
order  transition  matrices,  i.e.. 


there  is  another  Nth  dimensional  realization 

TAi,  Aa,  W,  c,  x^(0)]n  that  satisfies  Theorem  1, 
i.e.,  then  it  can  be  brought  to  the  above  canoni¬ 
cal  form  by  a  similarity  transformation,  i.e., 

T  s  C(0)O.  Hence,  the  two  realizations  are 
equivalent  in  the  sense  of  [Ai,  Aa,  W,  c,  Xm(0)]n 

=  [TA,T*‘,  TAaT*,  cT-’,  If  we 

recall  from  the  previous  section,  the  initial 
probabilities  are  used  in  a  state  equation  such 
as 

jtflc)  =  «(0)P(k) 

(18) 

=  n(0)OW  C(0) 


z(k)  =  z(0)W  (19) 

where  z(k)  =  Jt(k)0.  Then  if  we  apply  the 
similarity  transformation  to  (19),  i.e., 

Wk  =  T  ’,  we  get  J(k)T-‘  =  z(k)  =  jr(k)0  = 
ic(k).  This  shows  that  the  two  type  of  systems 

[ir(0),  P(1)]n  and  [Ai,Aa,  W,  c,  x^(0)]n  carry  the 
same  information.  Q 

Theorem  3:  Given  an  Nth  order  2-D  realization 

[Ai,  Aa,  W,  e,  x«.»(0)]n  that  satisfies  the  properties 
of  Theorem  1,  the  following  properties  are 
equivalent; 

i)  (Ai,  Aj,  c)  and  (Aj,  Aj.  x.^0))  are 
observable  and  controllable 

ii)  p(H)  *  p(P(0))  =  N 

iii)  p(0(Ai,  Aa,  c))  =  p(0(Ai,  c))  =  N 
p(C(Ai,  Aa,  xo.o(0)))  =  p(C(A2,  x«,a(0)))  =  N 

Proof;  One  can  show  that  the  global 
observability  and  controllability  matrices  [11] 
have  the  following  structure 


H=  P(2)  [P(0)  P(l)  P(2)...P(K-l)]  =  OC 


OA*  Oi 
0(Ai,Aa,c)=  oaI  =  Oa 


Lp(k-i)J 

Then  O  =  P(0),  C(0)  =  P(0),  W  =  P(l),  c  =  [1,  0,  0, 
t  •- 

...,0],  x^O)  s  c^,  Ai  =  In,  and  Ai »  In  (arrows  de¬ 
note  shifted  identity  matrices)  can  be  shown  to 
satisfy  the  properties  of  Theorem  1.  Suppose 


C(Ai,  Aa,  x^O))  =  [  C(0)  AiC(O) . .  •  Ar  ’CfO)  ] 
=[C(0)Ci(0)C2(0).-.  Cn-i(O)] 

where  0.  denotes  the  observability  matrix 
shifted  downwards  by  m  rows  (padded  with  zero 
rows).  Similarly,  Cn(0)  denotes  C(0)  shifted  to 


5.  Conclusioiis 

We  have  presented  a  2-D  realization 
theory  for  Markov  chains  which  yields  an  exact 
representation.  It  was  shown  that  the  Markov 
parameters  of  the  2-D  realization  exactly  match 
the  higher-order  transition  probability  matri¬ 
ces  of  the  Markov  chain.  Since  the  model  is 
obtained  from  a  "balanced”  ^i>e  (in  this  case 
optimal)  realization  algorithm,  one  should  ex¬ 
pect  the  robustness  properties  inherent  in  these 
algorithms.  Moreover,  a  parametrization  of  the 
2-D  realizations  presented  here  may  lead  to 
canonical  structures  for  certain  probability 
matrices,  i.e.,  birth-death  chains,  queueing 
chains,  etc..  Another  potential  application  is  in 
the  identification  of  Markov  chains  from  given 
data.  These  issues  and  other  extensions  are 
currently  being  investigated  and  will  be  re¬ 
ported  elsewhere. 

References 

[U  Derman,  C.  Finite  State  Markovian 
Decision  Processes,  Academic  Press, 
New  York,  1970. 

[2]  Ross,  S.  M.,  Introduction  to  Probability 
Models,  Academic  Press,  New  York, 
1972. 

[3]  Rabiner,  L.  R.,  "A  tutorial  on  hidden 
Markov  models  and  selected  applica¬ 
tions  in  speech  recognition".  Proceed¬ 
ings  of  the  IEEE,  Vol.  77,  No.  2,  Febru¬ 
ary  1989,  pp.  257-286. 

[4]  Howard,  R.  A.,  Dynamic  Program¬ 
ming  and  Markov  Processes,  M.I.T. 
Press,  Cambridge,  Massachusetts,  1960. 

[51  Rubinstein,  R.  Y.,  Simulation  and  the 
Monte  Carlo  "Method,  John  Wiley  & 
Sons,  New  York,  1981. 

[6)  Rao,  D.  V.  B.  and  S.  Y.  Kung,  "A  state- 
space  approach  for  the  2-D  harmonic 
retrieval  problem".  Proceedings 
ICASSP  84,  San  Diego  California,  pp. 
4.10.1-4.10.3, 1984. 

[71  Chiang,  C.  L.,  An  Introduction  to 
Stochastic  Processes  and  Their  Applica¬ 
tions,  Krieger  Publishing  Co., 
Huntington,  New  York,  1980. 

[81  Doob,  J.  L.,  Stochastic  Processes,  John 
Wiley  &  Sons,  New  York,  1953. 


[9]  Fernando,  K.  V.  and  H.  Nicholson, 
"Minimality  of  SISO  linear  systems," 
Proceedings  of  the  IEEE,  Vol.  70,  pp. 
1241-1242,  October  1982. 

[101  Brogan,  W.  L.,  Modem  Control  Theory, 
Quantum  Publishers,  Inc.,  New  York, 
1974. 

[11]  Attasi,  S.  ,"Modelling  and  recursive 
estimation  for  double  indexed  se¬ 
quences",  in:  System  Identification: 
Advances  and  Case  Studies,  ed.  R.  K. 
Mehra  and  D.  Lainiotis,  Academic 
Press,  New  York,  pp.  289-348, 1979. 


